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STREPTOCOCCUS PNEUMONIAE PROTEINS AND 
IMMUNOGENIC FRAGMENTSTFOR VACCINES 

This application is based on U.S. Provisional Application No. 
60/113,048, filed 21 December 1998, which is hereby incorporated in its 
entirety. 

FIELD OF THE INVENTION 

This invention relates generally to the field of bacterial antigens and their 
use, for example, as immunogenic agents in humans and animals to stimulate 
an immune response. More specifically, it relates to the vaccination of 
mammalian species with a polypeptide comprising at least one conserved 
histidine triad residue (HxxHxH) and at least one helix-forming polypeptide 
obtained from Streptococcus pneumoniae as a mechanism for stimulating 
production of antibodies that protect the vaccine recipient against infection by 
a wide range of serotypes of pathogenic S. pneumoniae. Further, the 
invention relates to antibodies against such polypeptides useful in diagnosis 
and passive immune therapy with respect to diagnosing and treating such 
pneumococcal infections. 

In a particular aspect, the present invention relates to the prevention and 
treatment of pneumococcal infections such as infections of the middle ear, 
nasopharynx, lung and bronchial areas, blood, CSF, and the like, that are 
caused by pneumococcal bacteria. 
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BACKGROUND OF THE INVENTION 

Streptococcus pneumoniae is a gram positive bacteria which is a major 
causative agent in invasive infections in animals and humans, such as sepsis, 
5 meningitis, otitis media and lobar pneumonia (Tuomanen et al. New Engl. J. 
Med. 322:1280-1284 (1995)). As part of the infective process, pneumococci 
readily bind to non-inflamed human epithelial cells of the upper and lower 
respiratory tract by binding to eukaryotic carbohydrates in a lectin-like manner 
(Cundell et al., Micro. Path. 17:361-374 (1994)). Conversion to invasive 

10 pneumococcal infections for bound bacteria may involve the local generation of 
inflammatory factors which may activate the epithelial cells to change the 
number and type of receptors on their surface (Cundell et al.. Nature, 
377:435-438 (1995)). Apparently, one such receptor, platelet activating factor 
(PAF) is engaged by the pneumococcal bacteria and within a very short period 

15 of time (minutes) from the appearance of PAF, pneumococci exhibit strongly 
enhanced adherence and invasion of tissue. Certain soluble receptor analogs 
have been shown to prevent the progression of pneumococcal infections 
(Idanpaan-Heikkila et al., J. Inf. Dis., 176:704-712 (1997)). A number of 
various other proteins have been suggested as being involved in the 

20 pathogenicity of S. pneumoniae. There remains a need for identifying 
polypeptides having epitopes in common from various strains of S. 
pneumoniae in order to utilize such polypeptides as vaccines to provide 
protection against a wide variety of S. pneumoniae. 

25 

SUMMARY OF INVENTION 

In accordance with the present invention, there is provided vaccines and 
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vaccine compositions that include polypeptides obtained from S. pneumoniae 
and/or variants of said polypeptides and/or active fragments of such 
polypeptides. 

5 The active fragments, as hereinafter defined, include a histidine triad 

residue(s) and/or coiled coil regions of such polypeptides. 

The term "percent identity" or "percent identical," when referring to a 
sequence, means that a sequence is compared to a claimed or described 
10 sequence from an alignment of the sequence to be compared (the "Compared 
Sequence") with the described or claimed sequence (the "Reference 
Sequence"). The percent identity is determined as follows: 

Percent Identity = [1- (C/R)l 100 

15 

wherein C is the number of differences between the Reference Sequence and 
the Compared Sequence over the length of the alignment between the 
Compared Sequence and the Reference Sequence wherein (i) each base or 
amino acid in the Reference Sequence that does not have an aligned base or 

20 amino acid in the Compared Sequence and (ii) each gap in the Reference 
Sequence and (iii) each aligned base or amino acid in the Reference Sequence 
that is different from an aligned base or amino acid in the Compared Sequence, 
each being a difference; and R is the number of bases or amino acids in the 
Reference Sequence over the length of the alignment with the Compared 

25 Sequence with any gap created in the Reference Sequence also being counted 
as a base or amino acid. 

If an alignment exists between the Compared Sequence and the 
Reference Sequence in which the Percent Identity as calculated above is about 
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equal to or greater than a specified minimum Percent Identity than the 
Compared Sequence has the specified minimum Percent Identity to the 
Reference Sequence even though alignments may exist in which the 
hereinabove calculated Percent Identity is less than the specified Percent 
5 Identity. 

"Isolated" in the context of the present invention with respect to 
polypeptides and/or polynucleotides means that the material is removed from 
its original environment (e.g., the natural environment if it is naturally 

10 occurring). For example, a naturally-occurring polynucleotide or polypeptide 
present in a living organism is not isolated, but the same polynucleotide or 
polypeptide, separated from some or all of the co-existing materials in the 
natural system, is isolated. Such polynucleotides could be part of a vector 
and/or such polynucleotides or polypeptides could be part of a composition, 

15 and still be isolated in that such vector or composition is not part of its natural 
environment. The polypeptides and polynucleotides of the present invention 
are preferably provided in an isolated form, and preferably are purified to 
homogeneity. 

20 

BRIEF DESCRIPTION OF DRAWINGS 

Figures 1A-1C, respectively, report the results of three experiments 
25 using different preparations of SP36. The results demonstrate that active 
immunization with recombinant SP36 derived from pneumococcal strain 
Norway serotype 4 is able to protect mice from death in a model of 
pneumococcal sepsis using a heterologous strain, SJ2 (serotype 6B). In each 
of the three experiments shown, one hundred percent of the mice immunized 
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with SP36 survived for the 14-clay observation period following challenge 
with approximately 500 cfu of pneumococci, while eighty to one hundred 
percent of sham-immunized mice (injected with PBS and adjuvant) died 
during the same period. 

5 

Figures 2A-2B show that passive administration of rabbit antiserum 
raised against Sp36 derived from Norway type 4 was able to protect mice in 
the pneumococcal sepsis model using two heterologous strains. Figure 2A 
shows that one hundred percent of the mice immunized with the SP36 

10 antiserum survived the 21-day observation period after challenge with 172 
CFU of strain SJ2 (serotype 6B). Eighty percent of the mice immunized 
with a control serum (rabbit anti-FimC) died by day 8, and ninety percent 
died by day 12. Figure 2B shows that 90 percent of the mice immunized 
with the Sp36 antiserum survived the 8-day observation after challenge with 

15 862 CFU of strain EF6796 (serotype 6A). Ninety percent of the mice 
immunized with a control serum (collected before immunization) died by day 
5. 

Figure 3 is a western blot demonstrating the ability of antisera raised 
20 against recombinant Sp36 derived from strain Norway type 4 to react with 
Sp36 of heterologous strains. Total cell lysates were immunoblotted with 
mouse antisera to Sp36. A band representing Sp36 protein was detected in 
all 23 S. pneumoniae strains tested, which included isolates from each of the 
23 pneumococcal serotypes represented in the current polysaccharide 
25 vaccine. 

Figure 4 is a Southern blot showing that the Sp36 gene from Norway 
type 4 hybridizes with genomic DNA from 24 other pneumococcal strains, 
indicating the presence of similar sequences in all these strains. 
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Figure 5 is a western blot showing the reactivity of patient sera with 
Sp36. Sp36 (either full-length, panel A; N-terminal half, panel B; or C- 
terminal half, panel C) was electrophoresed by SDS-PAGE and transferred to 
5 nitrocellulose. Patient sera collected soon after the onset of illness (acute 
serum, lanes A) or eight to 30 days later (convalescent serum, lanes C) were 
used to probe the blots. For patients 2, 3, and 5, convalescent serum 
reacted more strongly with Sp36 than did the corresponding acute serum. 

10 Figure 6 is an amino acid alignment comparison of four related 

pneumococcal proteins, namely Sp36A (PhtA; SEQ ID NO:8), Sp36B (PhtB; 
SEQ ID NO:10), Sp36D (PhtD; SEQ ID NO:4), Sp36E (PhtE; SEQ ID NO:6), 
respectively. Dashes in a sequence indicate gaps introduced to maximize the 
sequence similarity. Amino acid residues that match are boxed. 

15 

Figure 7 is a nucleotide alignment comparison of four related 
pneumococcal genes, namely Sp36A (PhtA; SEQ ID NO:9), Sp36B (PhtB; 
SEQ ID NO:11), Sp36D (PhtD; SEQ ID NO:5), Sp36E (PhtE; SEQ ID NO:7), 
respectively. Dashes in a sequence indicate gaps introduced to maximize the 
20 sequence similarity. 

Figure 8 shows the results of immunization of mice with PhtD 
recombinant protein, which leads to protection from lethal sepsis. C3H/HeJ 
(Panel A and B) or Balb/cByJ (Panel C) mice were immunized subcutaneously 
25 with PhtD protein (15 \xg in 50 \i\ PBS emulsified in 50 nl complete Freund's 
adjuvant (CFA)). The recombinant PhtD protein used in protection 
experiments consisted of 819 amino acid residues, starting with the cysteine 
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(residue 20). A group of 10 sham-immunized mice received PBS with 
adjuvant. A second immunization of 1 5 \xg protein with incomplete Freund's 
adjuvant (IFA) was administered 3 weeks later; the sham group received PBS 
with IFA. Blood was drawn (retro-orbital bleed) at week 7; and sera from 
5 each group was pooled for analysis of anti-PhtD antibody by ELISA. Mice 
were challenged at week 8 by an intraperitonial (i.p.) injection of 
approximately 550 CFU S. pneumoniae strain SJ2, serotype 6B (Panel A), 
850 CFU of strain EF6796, serotype 6A (Panel B) or 450 CFU of strain 
EF5668, serotype 4 (Panel C). In preliminary experiments, the LD 50 for strain 

10 SJ2 and EF6796 were determined to be approximately 10 CFU for both 
strains. The LD 50 for strain EF5668 was determined to be < 5 CFU. Survival 
was determined in all groups over the course of 15 days following challenge. 
Data are presented as the percent survival for a total of 10 mice per 
experimental group. Two-sample Log-rank test was used for statistical 

15 analysis comparing recombinant Pht immunized mice to sham-immunized 
mice. 



SUMMARY OF THE INVENTION 

20 

In accordance with one aspect of the present invention, there is 
provided a vaccine, generally in the form of a composition, that includes at 
least one polypeptide that is at least 90% identical to (c) a polypeptide 



7 



WO 00/37105 



PCT/US99/30390 



sequence comprising amino acids 1-819 of SEQ ID NO:4 or (ii) a polypeptide 
sequence comprising amino acids 1-460 of SEQ ID NO:6 or an active fragment 
of the foregoing. 

In accordance with another aspect of the present invention, there is 
provided a vaccine, generally in the form of a composition, that includes an 
active fragment of a polypeptide that is at least 90% identical to (i) a 
polypeptide comprising amino acids 1-800 of SEQ ID NO:8 or (ii) a polypeptide 
comprising amino acids 1-800 of SEQ ID NO:10. 

The term "active fragment" means a fragment that includes one or more 
histidine triad residues and/or one or more coiled coil regions. A "histidine 
triad residue" is the portion of the polypeptide that has the sequence HxxHxH 
wherein H is histidine and x is an amino acid other than histidine 

A coiled coil region is the region predicted by "Coils" algorithm: Lupas, 
A., Van Dyke, M., and Stock, J. (1991) Predicting Coiled Coils from Protein 
Sequences, Science 252:1 1 62-1 1 64. 

In accordance with one embodiment, the active fragment includes both 
one or more histidine triad residues and at least one coiled coil region of the 
applicable polypeptide sequence. In accordance with another embodiment, the 
active fragment includes at least two histidine triad residues. 

In another embodiment, the active fragment that includes at least one 
histidine triad residue or at least one coiled-coil region of the applicable 
polypeptide includes at least about ten percent of the applicable polypeptide 
and no more than about 85% of the applicable polypeptide. 
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The polypeptide of SEQ ID NO:4 includes five histidine triad residues, as 
follows: 

amino acids 64-69; 188-193; 296-301; 541-546; and 625-630. 

The polypeptide of SEQ ID NO:6 includes five histidine triad residues, as 
follows: 

amino acids 63-68; 185-190; 289-294, 376-381; and 441-446. 

In addition, the polypeptide of SEQ ID NO:4 includes two coiled-coil 
regions (amino acids 120-140 and amino acids 750-772) and the polypeptide 
of SEQ ID NO:6 includes one coiled-coil region (amino acids 1 19-152). 

The polypeptide of SEQ ID NO: 8 includes the following regions: 

HxxHxH: amino acids 63-68, 189-194, 309-314, 550-555, 634-639. 
Coiled-coils: amino acids 1 18-145, 406-434, 462-493, 724-751 . 

In accordance with a further aspect of the invention, a vaccine of the 
type hereinabove described is administered for the purpose of preventing or 
treating infection caused by S, pneumoniae. 

A vaccine, or vaccine composition, in accordance with the present 
invention may include one or more of the hereinabove described polypeptides 
or active fragments thereof. When employing more than one polypeptide or 
active fragment, such two or more polypeptides and/or active fragments may 
be used as a physical mixture or as a fusion of two or more polypeptides or 
active fragments. The fusion fragment or fusion polypeptide may be produced, 
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for example> by recombinant techniques or by the use of appropriate linkers for 
fusing previously prepared polypeptides or active fragments. 

In an embodiment of the invention, there is provided (a) a polypeptide 
5 that is at least 95% identical or at least 97% identical or 100% identical to (i) 
a polypeptide sequence comprising amino acids 1 to 819 of SEQ ID N0:4 or 
(ii) a polypeptide sequence comprising amino acids 1-460 of SEQ ID NO:6; or 
(b) an active fragment of the polypeptide of (a). 

10 In the case where the polypeptide is a variant of the polypeptide 

comprising the mature polypeptide of SEQ ID NO:4 or SEQ ID NO:6, or any of 
the active fragments of the invention, the variation in the polypeptide or 
fragment is generally in a portion thereof other than the histidine triad residues 
and the coiled-coil region, although variations in one or more of these regions 

15 may be made. 

In many cases, the variation in the polypeptide or active fragment is a 
conservative amino acid substitution, although other substitutions are within 
the scope of the invention. 

20 

In accordance with the present invention, a polypeptide variant includes 
variants in which one or more amino acids are substituted and/or deleted 
and/or inserted. 

25 In another aspect, the invention relates to passive immunity vaccines 

formulated from antibodies against a polypeptide or active fragment of a 
polypeptide of the present invention. Such passive immunity vaccines can be 
utilized to prevent and/or treat pneumococcal infections in patients. In this 
manner, according to a further aspect of the invention, a vaccine can be 

10 
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produced from a synthetic or recombinant polypeptide of the present invention 
or an antibody against such polypeptide. 

In still another aspect the present invention relates to a method of using 
5 one or more antibodies (monoclonal, polyclonal or sera) to the polypeptides of 
the invention as described above for the prophylaxis and/or treatment of 
diseases that are caused by pneumococcal bacteria. In particular, the 
invention relates to a method for the prophylaxis and/or treatment of infectious 
diseases that are caused by 5. pneumoniae. In a still further preferred aspect, 
10 the invention relates to a method for the prophylaxis and/or treatment of otitis 
media, nasopharyngeal, bronchial infections, and the like in humans by utilizing 
a vaccine of the present invention. 

Generally, vaccines are prepared as injectables, in the form of aqueous 
15 solutions or suspensions. Vaccines in an oil base are also well known such as 
for inhaling. Solid forms which are dissolved or suspended prior to use may 
also be formulated. Pharmaceutical carriers are generally added that are 
compatible with the active ingredients and acceptable for pharmaceutical use. 
Examples of such carriers include, but are not limited to, water, saline 
20 solutions, dextrose, or glycerol. Combinations of carriers may also be used. 

Vaccine compositions may further incorporate additional substances to 
stabilize pH, or to function as adjuvants, wetting agents, or emulsifying 
agents, which can serve to improve the effectiveness of the vaccine. 

25 

Vaccines are generally formulated for parental administration and are 
injected either subcutaneously or intramuscularly. Such vaccines can also be 
formulated as suppositories or for oral administration, using methods known in 
the art. 



11 



WO 00/37105 



PCT/US99/30390 



The amount of vaccine sufficient to confer immunity to pathogenic 
bacteria is determined by methods well known to those skilled in the art. This 
quantity will be determined based upon the characteristics of the vaccine 
5 recipient and the level of immunity required. Typically, the amount of vaccine 
to be administered will be determined based upon the judgment of a skilled 
physician. Where vaccines are administered by subcutaneous or intramuscular 
injection, a range of 50 to 500 \xg purified protein may be given. 

10 The present invention is also directed to a vaccine in which a 

polypeptide or active fragment of the present invention is delivered or 
administered in the form of a polynucleotide encoding the polypeptide or active 
fragment, whereby the polypeptide or active fragment is produced in vivo. 
The polynucleotide may be included in a suitable expression vector and 

15 combined with a pharmaceutical^ acceptable carrier. 

In addition, the polypeptides of the present invention can be used as 
immunogens to stimulate the production of antibodies for use in passive 
immunotherapy, for use as diagnostic reagents, and for use as reagents in 
20 other processes such as affinity chromatography. 

In another aspect the present invention provides polynucleotides which 
encode the hereinabove described polypeptides and active fragments of the 
invention. The polynucleotide of the present invention may be in the form of 
25 RNA or in the form of DNA, which DNA includes cDNA, genomic DNA, and 
synthetic DNA. The DNA may be double-stranded or single-stranded, and if 
single stranded may be the coding strand or non-coding (anti-sense) strand. 

In accordance with another aspect of the present invention, there is 

12 
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provided 

(A) an isolated polynucleotide that is at least 90% identical to a 
polynucleotide sequence encoding (i) a polypeptide comprising amino acids 1- 
819 of SEQ ID NO:4 or (ii) a polypeptide comprising amino acids 1-460 of SEQ 
ID NO:6, or 

(B) a fragment of the polynucleotide of (A) that encodes an active 
polypeptide fragment or 

(C) a polynucleotide that is at least 90% identical to a 
polynucleotide sequence encoding an active fragment of (i) a polypeptide 
comprising amino acids 1 to 800 of SEQ ID NO:8 or (ii) a polypeptide 
comprising amino acids 1 to 800 of SEQ ID NO: 10. 

In specific embodiments, the polynucleotide is at least 95% identical, 
preferably at least 97% identical, and even 100% identical to such 
polynucleotide sequence. 

The term "polynucleotide encoding a polypeptide" encompasses a 
polynucleotide which includes only coding sequence for the polypeptide as well 
as a polynucleotide which includes additional coding and/or non-coding 
sequence. 

The present invention further relates to variants of polynucleotides. The 
variants of the polynucleotides may be a naturally occurring allelic variant of 
the polynucleotides or a non-naturally occurring variant of the polynucleotides. 
The variants include variants in which one or more bases are substituted, 
deleted or inserted. Complements to such coding polynucleotides may be 
utilized to isolate polynucleotides encoding the same or similar polypeptides. 
In particular, such procedures are useful to obtain native immunogenic portions 
of polypeptides from different serotypes of S. pneumoniae, which is especially 
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useful in the production of "chain" polypeptide vaccines containing multiple 
immunogenic segments. 

SEQ ID IMO:5 is a representative example of a polynucleotide encoding 
5 the polypeptide of SEQ ID N0:4 and SEQ ID NO:7 is a representative example 
of a polynucleotide encoding the polypeptide of SEQ ID NO:6. SEQ ID NO:9 is 
a representative example of a polynucleotide encoding the polypeptide of SEQ 
ID NO:8, and SEQ ID NO:11 is a representative example of a polynucleotide 
encoding the polypeptide of SEQ ID NO: 10. As a result of the known 
10 degeneracy of the genetic code, other polynucleotides that encode the 
polypeptides of SEQ ID N0:4, SEQ ID NO:6, SEQ ID NO:8 and SEQ ID NO: 10 
should be apparent to those skilled in the art from the teachings herein. 

The polynucleotides encoding the immunogenic polypeptides described 
15 above may also have the coding sequence fused in frame to a marker 
sequence which allows for purification of the polypeptides of the present 
invention. The marker sequence may be, for example, a hexa-histidine tag 
supplied by a pQE-9 vector to provide for purification of the mature 
polypeptides fused to the marker in the case of a bacterial host, or, for 
20 example, the marker sequence may be a hemagglutinin (HA) tag when a 
mammalian host, e.g. COS-7 cells, is used. The HA tag corresponds to an 
epitope derived from the influenza hemagglutinin protein (Wilson, I., et al., Cell, 
37:767 (1984)). 

25 The present invention also relates to vectors which include 

polynucleotides encoding one or more of the polypeptides of the invention, 
host cells which are genetically engineered with vectors of the invention and 
the production of such immunogenic polypeptides by recombinant techniques 
in an isolated and substantially immunogenically pure form. 

14 
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Host cells are genetically engineered (transduced or transformed or 
transfected) with the vectors comprising a polynucleotide encoding a 
polypeptide of the invention. The vector may be, for example, in the form of a 
5 plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating 
promoters, selecting transformants or amplifying the polynucleotides which 
encode such polypeptides. The culture conditions, such as temperature, pH 
and the like, are those previously used with the host cell selected for 
10 expression, and will be apparent to the ordinarily skilled artisan. 

Vectors include chromosomal, nonchromosomal and synthetic DNA 
sequences, e.g., derivatives of SV40; bacterial plasmids; phage DNA; 
baculovirus; yeast plasmids; vectors derived from combinations of plasmids 
15 and phage DNA, viral DNA such as vaccinia, adenovirus, fowl pox virus, and 
pseudorabies. However, any other vector may be used as long as it is 
replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by a 
20 variety of procedures. In general, the DNA sequence is inserted into an 
appropriate restriction endonuclease site(s) by procedures known in the art. 
Such procedures and others are deemed to be within the scope of those skilled 
in the art. 

25 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA 
synthesis. As representative examples of such promoters, there may be 
mentioned: LTR or SV40 promoter, the E. coli. lac or t[£, the phage lambda P L 
promoter and other promoters known to control expression of genes in 

15 
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prokaryotic or eukaryotic cells or their viruses. The expression vector also 
contains a ribosome binding site for translation initiation and a transcription 
terminator. The vector may also include appropriate sequences for amplifying 
expression. 

5 

In addition, the expression vectors preferably contain one or more 
selectable marker genes to provide a phenotypic trait for selection of 
transformed host cells such as dihydrofolate reductase or neomycin resistance 
for eukaryotic cell culture, or such as tetracycline or ampicillin resistance in E 
10 coli . 

The vector containing the appropriate DNA sequence as hereinabove 
described, as well as an appropriate promoter or control sequence, may be 
employed to transform an appropriate host to permit the host to express the 
15 proteins. 

As representative examples of appropriate hosts, there may be 
mentioned: bacterial cells, such as E. coli , Streptomyces , Salmonella 
typhimurium ; fungal cells, such as yeast; insect cells such as Drosophila S2 
20 and Spodoptera Sf9 ; animal cells such as CHO, COS or Bowes melanoma; 
adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

More particularly, the present invention also includes recombinant 
25 constructs comprising one or more of the sequences as broadly described 
above. The constructs comprise a vector, such as a plasmid or viral vector, 
into which a sequence of the invention has been inserted, in a forward or 
reverse orientation. In a preferred aspect of this embodiment, the construct 
further comprises regulatory sequences, including, for example, a promoter, 
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operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially 
available. The following vectors are provided by way of example. Bacterial: 
pQE70, pQE60, pQE-9 (Qiagen, Inc.), pbs, pD10, phagescript, psiX174, 
pbluescript SK, pbsks, pNH8A, pNH16a, pNH18A, pNH46A (Stratagene); 
ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). Eukaryotic: 
pWLNEO, pSV2CAT, pOG44, pXT1, pSG (Stratagene) pSVK3, pBPV, pMSG, 
pSVL (Pharmacia). However, any other plasmid or vector may be used as long 
as they are replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P R , P L and TRP. Eukaryotic 
promoters include CMV immediate early, HSV thymidine kinase, early and late 
SV40, LTRs from retrovirus, and mouse metallothionein-l. Selection of the 
appropriate vector and promoter is well within the level of ordinary skill in the 
art. 

In a further embodiment, the present invention relates to host cells 
containing the above-described constructs. The host cell can be a higher 
eukaryotic cell, such as a mammalian cell, or a lower eukaryotic cell, such as a 
yeast cell, or the host cell can be a prokaryotic cell, such as a bacterial cell. 
Introduction of the construct into the host cell can be effected by calcium 
phosphate transfection, DEAE-Dextran mediated transfection, or 
electroporation (Davis, L, Dibner, M., Battey, I., Basic Methods in Molecular 
Biology, (1986)). 

The constructs in host cells can be used in a conventional manner to 
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produce the gene product encoded by the recombinant sequence. 
Alternatively, the polypeptides of the invention can be synthetically produced 
by conventional peptide synthesizers. 

5 Mature proteins can be expressed in mammalian cells, yeast, bacteria, 

or other cells under the control of appropriate promoters. Cell-free translation 
systems can also be employed to produce such proteins using RNAs derived 
from the DNA constructs of the present invention. Appropriate cloning and 
expression vectors for use with prokaryotic and eukaryotic hosts are described 
10 by Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, 
Cold Spring Harbor, N.Y., (1989), the disclosure of which is hereby 
incorporated by reference. 

Transcription of the DNA encoding the polypeptides of the present 
15 invention by higher eukaryotes is increased by inserting an enhancer sequence 
into the vector. Enhancers are cis-acting elements of DNA, usually about from 
10 to 300 bp that act on a promoter to increase its transcription. Examples 
including the SV40 enhancer on the late side of the replication origin bp 1 00 to 
270, a cytomegalovirus early promoter enhancer, the polyoma enhancer on the 
2 0 late side of the replication origin, and adenovirus enhancers. 

Generally, recombinant expression vectors will include origins of 
replication and selectable markers permitting transformation of the host cell, 
e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and 
25 a promoter derived from a highly-expressed gene to direct transcription of a 
downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3-phosphogly cerate kinase (PGK), a- 
factor, acid phosphatase, or heat shock proteins, among others. The 
heterologous structural sequence is assembled in appropriate phase with 
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translation initiation and termination sequences. Optionally, the heterologous 
sequence can encode a fusion protein including an N-terminal identification 
peptide imparting desired characteristics, e.g., stabilization or simplified 
purification of expressed recombinant product. 

5 

Useful expression vectors for bacterial use are constructed by inserting 
a structural DNA sequence encoding a desired protein together with suitable 
translation initiation and termination signals in operable reading phase with a 
functional promoter. The vector will comprise one or more phenotypic 

10 selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable 
prokaryotic hosts for transformation include E. coli , Bacillus subtilis , Salmonella 
typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as 

15 a matter of choice. 

As a representative but nonlimiting example, useful expression vectors 
for bacterial use can comprise a selectable marker and bacterial origin of 
replication derived from commercially available plasmids comprising genetic 
20 elements of the well known cloning vector pBR322 (ATCC 37017). Such 
commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM1 (Promega Biotec, Madison, Wl, USA). 
These pBR322 "backbone" sections are combined with an appropriate 
promoter and the structural sequence to be expressed. 

25 

Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced by 
appropriate means (e.g., temperature shift or chemical induction) and cells are 
cultured for an additional period. 
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Cells are typically harvested by centrifugation, disrupted by physical or 
chemical means, and the resulting crude extract retained for further 
purification. 

5 

Microbial cells employed in expression of proteins can be disrupted by 
any convenient method, including freeze-thaw cycling, sonication, a french 
press, mechanical disruption, or use of cell lysing agents, such methods are 
well know to those skilled in the art. However, preferred are host cells which 
10 secrete the polypeptide of the invention and permit recovery of the polypeptide 
from the culture media. 

Various mammalian cell culture systems can also be employed to 
express recombinant protein. Examples of mammalian expression systems 

15 include the COS-7 lines of monkey kidney fibroblasts, described by Gluzman, 
Cell, 23:175 (1981), and other cell lines capable of expressing a compatible 
vector, for example, the C127, 3T3, CHO, HeLa and BHK cell lines. 
Mammalian expression vectors will comprise an origin of replication, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, 

20 polyadenylation site, splice donor and acceptor sites, transcriptional 
termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be 
used to provide the required nontranscribed genetic elements. 

25 The polypeptides can be recovered and/or purified from recombinant cell 

cultures by well-known protein recovery and purification methods. Such 
methodology may include ammonium sulfate or ethanol precipitation, acid 
extraction, anion or cation exchange chromatography, phosphocellulose 
chromatography, hydrophobic interaction chromatography, affinity 
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chromatography, hydroxylapatite chromatography and lectin chromatography. 
Protein refolding steps can be used, as necessary, in completing configuration 
of the mature protein. In this respect, chaperones may be used in such a 
refolding procedure. Finally, high performance liquid chromatography (HPLC) 
5 can be employed for final purification steps. 

The polypeptides that are useful as immunogens in the present invention 
may be a naturally purified product, or a product of chemical synthetic 
procedures, or produced by recombinant techniques from a prokaryotic or 
10 eukaryotic host (for example, by bacterial, yeast, higher plant, insect and 
mammalian cells in culture). Depending upon the host employed in a 
recombinant production procedure, the polypeptides of the present invention 
may be glycosylated or may be non-glycosylated. 

15 Procedures for the isolation of the individually expressed polypeptides 

may be isolated by recombinant expression/isolation methods that are well- 
known in the art. Typical examples for such isolation may utilize an antibody 
to a conserved area of the protein or to a His tag or cleavable leader or tail that 
is expressed as part of the protein structure. 

20 

The polypeptides, their fragments or other derivatives, or analogs 
thereof, or cells expressing them can be used as an immunogen to produce 
antibodies thereto. These antibodies can be, for example, polyclonal or 
monoclonal antibodies. The present invention also includes chimeric, single 
25 chain, and humanized antibodies, as welt as Fab fragments, or the product of 
an Fab expression library. Various procedures known in the art may be used 
for the production of such antibodies and fragments. 

Antibodies generated against the polypeptides corresponding to a 
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sequence of the present invention can be obtained by direct injection of the 
polypeptides into an animal. 

For preparation of monoclonal antibodies, any technique which provides 
5 antibodies produced by continuous cell line cultures can be used. Examples 
include the hybridoma technique (Kohler and Milstein, 1975, Nature, 256:495- 
497), the trioma technique, the human B-cell hybridoma technique (Kozbor et 
al., 1983, Immunology Today 4:72), and the EBV-hybridoma technique to 
produce human monoclonal antibodies (Cole, et al., 1985, in Monoclonal 
10 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

Techniques described for the production of single chain antibodies (U.S. 
Patent 4,946,778) can be adapted to produce single chain antibodies to 
immunogenic polypeptide products of this invention. Also, transgenic mice 
may be used to express humanized antibodies to immunogenic polypeptide 
15 products of this invention. 

The invention will be further described with respect to the following 
examples; however, the scope of the invention is not limited thereby: 

20 Example 1 

Active Protection with Anti-Sp36 

A Cloning, expression, and purification of SP36 

25 The genomic DNA used as target for amplification was isolated from 

S. pneumoniae Norway strain (serotype 4), the same strain used for genomic 
sequencing. The complete sequence of the Sp36 gene (SEQ ID NO:9), and 
its predicted amino acid sequence (SEQ ID NO:8), are given in the Sequence 
Listing appended hereto. It was noted that the predicted amino acid 
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sequence included a hydrophobic leader sequence followed by a sequence 
(LSVC) similar to the consensus sequence for Type II signal peptidase (LxxC, 
in which both x's typically represent small amino acids). Primers (listed as 
SEQ ID NOS:1-3) were designed that would amplify the Sp36 gene and allow 
its cloning into P QE10 and expression as a histidine-tagged protein lacking 
the signal sequence for purification by nickel-affinity chromatography. 
Cloning of the fragment amplified by SEQ ID Nos 1 and 3 would result in a 
protein containing amino acids 2 through 800 of Sp36; cloning of the 
fragment amplified by SEQ ID Nos 2 and 3 would result in a protein 
containing amino acids 7 through 800 of Sp36 (amino acid numbers refer to 
SEQ ID NO:8). 

B. Active Protection With Sp36 Vaccination 

In each of the three experiments shown in Figures 1A-1C, C3H/HeJ 
mice (10/group) were immunized intraperitoneally (i.p.) with Sp36 protein 
(15 ng in 50 jxl PBS emulsified in 50 \i\ complete Freund's adjuvant (CFA)). A 
group of 10 sham-immunized mice received PBS with adjuvant. A second 
immunization of 15 ng protein with incomplete Freund's adjuvant (IFA) was 
administered 4 weeks later; the sham group received PBS with IFA. Blood 
was drawn (retro-orbital bleed) at weeks 3. 6, and 9; and sera from each 
group were pooled for analysis of anti-S P 36 antibody by ELISA. Mice were 
challenged at week 10 by an i.p. injection of approximately 500 CFU S. 
pneumoniae strain SJ2 (serotype 6B; provided by P. Flynn, St. Jude 
Children's Research Hospital, Memphis, TN). In preliminary experiments, the 
LD 50 of this strain was determined to be approximately 10 CFU. Mice were 
monitored for 14 days for survival. 

The three experiments shown in Figures 1A-1C used slightly different 
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preparations of recombinant S P 36. The experiments shown in Figure 1A and 
1B both used Sp36 containing amino acids 20-815, but different batches of 
protein were used in the two experiments. The experiment shown in Rgure 
1C used Sp36 containing amino acids 25-815. 

in the experiment shown in Figure 1 A, 9-week sera collected from the 
ten mice immunized with S P 36 (first batch) had an endpoint ELISA fter of 
1-4,096,000. No anti-S P 36 antibody was detected in sera from sham- 
immunized mice. One hundred percent of the mice immunized with Sp36 
0 protein survived the challenge (520 cfu of pneumococci) for 14 days. E.ghty 
percent of sham-immunized mice were dead by day 4, and the rema.nder 
survived. 

In the experiment shown in Figure 1B, 9-week sera collected from the 
L5 ten mice immunized with Sp36 (second batch) had an endpoint ELISA titer of 
> 1:4,096,000. No anti-S P 36 antibody was detected in sera from sham- 
immunized mice. One hundred percent of the mice immunized with Sp36 
protein survived the challenge (510 cfu of pneumococci) for 14 days. Of the 
sham-immunized mice, eighty percent were dead by day 4. and all died by 
20 day 9. 

In the experiment shown in Figure 1C, 9-week sera collected from the 
ten mice immunized with Sp36 (containing amino acids 25- 815) had an 
endpoint ELISA titer of 1 :4,096,000. No anti-S P 36 antibody was detected in 
25 sera from sham-immunized mice. One hundred percent of the mice 
immunized with Sp36 protein survived the challenge (510 cfu of 
pneumococci) for 14 days. Of the sham-immunized mice, ninety percent died 
by day 4, and all died by day 12. These data demonstrate that immunization 
of mice with recombinant Sp36 proteins elicits a response capable of 
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protecting against systemic pneumococcal infection and death. This 
protection was not strain-specific: the recombinant pneumococcal prote.n 
was cloned from a serotype 4 strain, while the challenge was w.th a 
heterologous strain, SJ2 (serotype 6B). 

Example 2 

Passive Protection with Anti-Sp 36 Antisera 
A. Generation of Rabbit Immune Sera 

Following collection of preimmune serum, a New Zealand White rabbit 
was immunized with 250 M g of S P 36 (containing amino acids 20-815) in 
CFA. The rabbit was given two boosts of 125 ug Sp36 in IFA on days 29 
and 50 and bled on days 39 and 60. A second rabbit was immunized with a 
control antigen, E. coli FimC. 

B. Passive Protection in Mice 

C3H/HeJ mice (10 mice/group) were passively immunized by two i.p. 
injections of 100 pi of rabbit serum. The first injection was administered 
twenty-four hours before challenge with 172 cfu of S. pneumoniae strain 
SJ2, and the second injection was given four hours after challenge. Figure 2 
shows the survival of mice after infection with two different strains of 
pneumococci. 

Figure 2A shows that of mice injected with 172 cfu of strain SJ2 
(Figure 2A), one hundred percent of the mice immunized with rabbit immune 
serum raised against Sp36 protein survived the 21 -day observation period. 
Of the mice immunized with the control serum (anti-FimC), eighty percent 
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died by day 8, and ninety percent died by day 12. Figure 2B shows that of 
mice injected with 862 cfu of strain EF6796, ninety percent of the m.ce 
immunized with rabbit immune serum raised against S P 36 protein surv.ved 
the 8-day observation period. Of those given a control serum (collected from 
a rabbit before immunization), ninety percent died by day 8. 

These data indicate that the protection against pneumococcal infection 
resulting from immunization with Sp36 is antibody-mediated, since mice can 
be protected by passive transfer of serum from a hyperimmunized rabbit. As 
seen in the mouse active challenge experiments described above, serum 
directed against recombinant Sp36 protein cloned from a serotype 4 strain 
was protective against challenge with heterologous strains. 



Example 3 

Pnnsfirvation of Sp36 Among Strains of S. pneumoniae 



A. Western blotting 



The 23 pneumococcal strains used in this experiment were obtained 
from the American Type Culture Collection (Rockville. MD) and include one 
isolate each of the 23 serotypes in the multivalent pneumococcal vaccine. 
For total cell lysates, pneumococci were grown to mid-logarithmic phase 
(optical density at 620 nm. 0.4 to 0.6) in 2 ml Todd-Hewitt broth with 0.5% 
yeast extract (Difco. Detroit, ME) at 37°C. Bacteria were harvested by 
centrifugation and washed twice with water. Pellets were resuspended in 
200 nl lysis buffer (0.01% sodium dodecyl sulfate, 0.15 M sodium citrate 
and 0.1% sodium deoxycholate) and incubated at 37°C for 30 min, then 
diluted in an equal volume 2x SSC (0.3 M sodium chloride, 0.03 M sodium 
citrate). Lysates were separated by SDS-PAGE, transferred to nitrocellulose 
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membranes (Bio-Rad Laboratories, Hercules, CA), and probed with antibody 
in a standard Western blotting procedure. Sera from ten C3H/HeJ m.ce 
immunized with Sp36 (as described in Example 1) were pooled and used at a 
dilution of 1:3000. Bound antibody was detected with peroxidase- 
conjugated sheep anti- mouse IgG using the chemiluminescence kit from 
Amersham, Inc. (Cambridge, MA). 

The mouse anti-S P 36 sera detected two major bands with apparent 
molecular weights of 97 and 100 kDa in all 23 pneumococcal lysates tested 
(shown in Figure 3). The Sp36 signals obtained from S. pneumoniae 
serotypes 1, 5, 17F and 22F were lower, indicating either that the level of 
Sp36 expression is reduced in these strains, or that Sp36 in these strains is 
antigenically different. 

These data show that Sp36 is antigenically conserved among strains 
of the 23 pneumococcal serotypes represented in the current polysaccharide 
vaccine. 



B. Southern blotting 

Genomic DNA was prepared from each of the 23 pneumococcal 
strains listed in the previous section and also from strain SJ2. DNA was 
digested with PvuW and SaroHl, electrophoresed in an agarose gel and 
transferred to a nylon membrane. A probe was prepared by amplifying the 
Sp36 gene from Norway type 4 DNA (as in Example 1) and labeling the 
amplified fragment with fluorescein by the random-priming method, using a 
kit from Amersham. Hybridization, washing, and exposure of film were 
carried out as in the protocol supplied by Amersham. Figure 4 shows that 
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the Sp36 probe hybridized with DNA from each of the 24 strains studied. 
The lane marked "M" contained DNA from lambda phage, digested w.th 
HindM and labeled with fluorescein, as molecular weight markers. 

Example 4 

immuno genicity of Sp36 in Humans 

In order to determine whether Sp36 is immunogenic during human 
pneumococcal infection, sera from patients with culture-proven 
pneumococcal bacteremia were used in Western blots conta.n.ng 
recombinant Sp36 protein. In the experiment shown in Figure 5, sera from 
five patients (indicated as 1 through 5) were diluted 1:3000 and used to 
probe blots containing full-length Sp36, the N-terminal half of Sp36 
(preceding the proline-rich region), or the C-terminal half of Sp36 (following 
the proline-rich region). Lanes labeled A (acute) were probed with serum 
collected shortly after diagnosis of pneumococcal infection; lanes C 
(convalescent) were probed with serum collected either one month later 
(patients 1, 2, and 3) or eight days after the first serum collection (patients 4 
and 5). For patients 2, 3 and 5, reactivity of the convalescent serum with 
0 Sp36 was stronger that that of the corresponding acute serum. The 
difference between the acute and convalescent sera was particularly evident 
for reactivity with the C-terminal half of the protein. 

In additional experiments (not shown), convalescent sera from 23 
15 patients with pneumococcal infections were tested individually for reactivity 
with full-length Sp36: 20 of the 23 sera were found to bind Sp36 on a 
Western blot. 

These experiments indicate that Sp36 is recognized by the human 
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immune system and suggest that antibodies able to bind the Sp36 protein 
may be produced during natural S. pneumoniae infection in humans. S.nce 
the patients were infected with a variety of pneumococcal strains, these data 
also support the idea that Sp36 is antigenically conserved. 

Example 5 

Table 1 provides the percent identity between the various sequences. 

Alignment of the predicted amino acid sequences of PhtA, PhtB, PhtD, 
and PhtE using the MEG ALIGN program of Lasergene showed strong N- 
terminal homology with substantial divergence of the C-termini (Figure 6). 
The alignment of the nucleotide sequences of the same genes is shown in 
Figure 7. Amino acid and nucleotide sequences were compared using the 
identity weighting in a Lipman-Pearson pairwise alignment, in which the 
number of matching residues is divided by the total of matching residues plus 
the number of mismatched residues plus the number of residues in gaps. In 
the table below, the percent identity between each pair of sequences is 
shown at the intersection of the corresponding row and column. 

Example 6 

Active Protection with PhtD Vaccination. 

Mice immunized with recombinant PhtD derived from strain N4 
generated potent antibody titers (reciprocal endpoint titers ranging form 
2,048,00 to 4,096,000). Mice immunized with PhtD were protected against 
death following intraperitoneal injection with either of three heterologous 
strains, SJ2 (serotype 6B; provided by P. Flynn, St. Jude Children's Research 
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Hospital, Memphis, TN), EF6796 (serotype 6A) or EF5668 (serotype 4; both 
strains provided by D. Briles, University of Alabama, Birmingham). In the 
experiment shown in Figure 8 (Panel A), all ten of the sham-immunized mice 
died within 10-days after challenge with virulent pneumococci (strain SJ2), 
while eighty percent of the PhtD-immunized mice survived the 15-day 
observation period. Immunization with PhtD also protected against a serotype 
6A strain, EF6796 (Panel B) and a serotype 4 strain, EF5668 (Panel C). In 
the experiment shown in Figure 8 (Panel B), all ten of the sham-immunized 
mice died within 7-days after challenge with virulent pneumococci (strain 
EF6796), while ninety percent of the PhtD-immunized mice survived the 15- 
day observation period. In the experiment shown in Figure 8 (Panel C), all ten 
of the sham-immunized mice died within 6-days after challenge with virulent 
pneumoccoci (strain EF5668), while eight of nine mice immunized with PhtD 
survived the 15-day observation period. 
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Table 1 . Percent Identities 




Percent Identity Between Amino Acid Sequences 


PhtA PhtB PhtD 


PhtE 


PhtA — 66.4 63.9 


49.5 


PhtB — 87.2 


49.5 


PhtD 


49.8 


PhtE 




Percent Identity Between Nucleotide Sequences 


PhtA PhtB PhtD 


PhtE 


PhtA ~- 58.3 59.3 


47.9 


PhtB «- 86.4 


47.4 


PhtD 


47.9 


PhtE 
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1 . A vaccine composition comprising: 

(a) at least one member selected from the groups consisting 
5 of (i) a polypeptide comprising a polypeptide sequence that is at least 90% 

identical to amino acids 1-819 of SEQ ID N0:4; (ii) a polypeptide comprising 
a polypeptide sequence that is at least 90% identical to amino acids 1-460 
of SEQ ID NO:6; (iii) a fragment of the polypeptide of (i) that includes at least 
one of a histidine triad residue or coiled-coil region; (iv) a fragment of the 

10 polypeptide of (ii) that includes at least one of a histidine triad residue or a 
coiled-coil region; (v) a fragment of a polypeptide that is at least 90% 
identical to the polypeptide sequence comprising amino acids 1-800 of SEQ 
ID NO:8, wherein said fragment includes at least one of a histidine triad 
residue or coiled-coil region wherein said fragment includes at least 80 amino 

15 acids and no more than 680 amino acids; and (vi) a fragment of a 
polypeptide that is at least 90% identical to the polypeptide sequence 
comprising amino acids 1-800 of SEQ ID NO: 10, wherein said fragment 
includes at least one of a histidine triad residue or coiled-coil region wherein 
said fragment includes at least 80 amino acids and no more than 680 amino 

20 acids; and 

(b) a pharmaceutical^ acceptable carrier. 

2. A process for preventing infection caused by S. pneumoniae 
comprising: 

25 administering the vaccine of claim 1 . 

3. A vaccine composition comprising: 

(a) at least one antibody against a member selected from the 
group consistng of (i) a polypeptide comprising a polypeptide sequence that 
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is at least 90% identical to amino acids 1-819 of SEQ ID NO:4; (ii) a 
polypeptide comprising a polypeptide sequence that is at least 90% identical 
to amino acids 1-460 of SEQ ID NO:6; (iii) a fragment of the polypeptide of 
(i) that includes at least one of histidine triad residue or coiled-coil region; (iv) 
5 a fragment of the polypeptide of (ii) that includes at least one of a histidine 
triad residue or a coiled-coil region; (v) a fragment of a polypeptide that is at 
least 90% identical to the polypeptide sequence comprising amino acids 1- 
800 of SEQ ID NO:8, wherein said fragment includes at least one of a 
histidine triad residue or coiled-coil region wherein said fragment includes at 

10 least 80 amino acids and no more than 680 amino acids and (vi) a fragment 
of a polypeptide that is at least 90% identical to the polypeptide sequence 
comprising amino acids 1-800 of SEQ ID NO: 10, wherein said fragment 
includes at least one of a histidine triad residue or coiled-coil region wherein 
said fragment includes at least 80 amino acids and no more than 680 amino 

15 acids. 
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Figure 1 
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Figure 2 
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Figure 6 (a) 
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Figure 6 (b) 
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Figure 6(c) 
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Figure 7 (a) 
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Figure 7 (b) 



CCTTAAGOATOCAOt CATOCOGATAATGTTCOOACAAAAGAAGAGATTA Majority 

* * 1 I t 
360 370 380 390 400 

408 CCTTAAGGATGCTGCCCACGCGGATAACGTCCCTACAAAAGAGGAAATCA phtA.SEQ 

351 CCTTAAGGATGCRGCTCATGCOOATAATATTCGGACAAAAGAAGAGATTA phC8.s«9 

351 CCTTAAGCATGCAGCTCATGCCCATAATATTCCCACAAAAGAAGACATTA phtO.SEQ 

411 CCTGAAAGATGCAGCTCATOCTOATAATGTTCCAACTAAAGATOAAATCA phtE.SEQ 

ATCOTCAGAAGCAOOAACATAGTCATAATCATOAGOOTOOAXCT---A-- Majority 
1 i * i * * 

410 420 430 '440 450 

455 XTCGACAAAAACAAOAOCATAOTCAACATCOTOAAGGTGGAACTCCAAGA phCA.SEQ 

401 AACCTCAOAAaCAOOAACOCAOTCATAATCATAACTCAAOACCA phtfi.seq 

401 AACGTCAOAAGCAGOAACACAGTCATAATCACOGOOOTGOTTCT------ phtO.SEQ 

461 ATCGTCAAAAACAAGAACATOTCAAAOATAATGAO--. -* A A G phtE.SEQ 

GATGATXXTG CTQTTOCTOTAOCCAQATCCCAAOOACGCTATACAACGOA Majority 
I I I I I 

460 470 480 490 500 

' i t 

508 AACGATGOTGCTOTTaCCTTaaCACOTTCOCAAOGACGCTATACTACAGA phtA.SCQ 

44S OAT A AT OCTGTTQCfOCAQCCAQAQCCCAAOOACOTTATACAACQGA pht8.*«q 

445 AACGATCAAGCAOTAOTTOCAOCCAaAOCCCAAOOACOCTATACAACCGA phtO.SEQ 
499 OTTAACTCTAATOTTOCTaTAOCAAOOTCTCACOQACOATATACOACAAA phtE.SEQ 

TGATOOTTATATCTTTAATCCATCTOATATCATTOAOOATACOCOTOATO Majority 

510 520 530 540 550 

i i i i 

555 TCATCCTTATATCTTTAATOCTTCTOATATCATAOAOOATACTQCTOATO phtA.SEO 
491 TOATGOGTATATCTTCAATOCATCTOATATCATTOAOOACACOCCTOATO phtB.sa? 
495 TGATGOTTATATCTTCAATQCATCTOATAT CATTOAGGACACOGGTOATG phtO.SEQ 
549 TOATOCTTATOTCTTTAATCCAOCTOATATTATCOAAOATACOGOTAATO phtE.SEQ 

CTTATATCOTTCCTCATOOCOATCATTACCATTACATTCCTAAQAATOAO Majorit* 

560 570 550 590 600 

■ * i i » 

608 CTTATATCOTTCCTCATOOAOATCATTACCATTACATTCCTAAGAATGAG phtA.SEO 

543 CTTATATCGTTCCTCACGOCOACCATTACCATTACATTCCTAAOAATGAG phtB.aao; 

S45 CTTATATCGTTCCTCACOOCCACCATTACCATTACATTCCTAAGAATOAG phtO.SEQ 

599 CTTATATCOTTCCTCATOOAOOTCACTATCACTACATTCCCAAAAGCOAT phtE.SEQ 

T T ATCAQCTAOCQAQTTAQCTOCTQCAQAAQCC T ATTTQQ AT GOG A Majority 

610 630 630 640 650 

' * ' ■ 1 

658 TTATCAOCTAOCOAOTTOOCTQCTOCAOAAOCCTTCCTATCTOGTCGAGG phtA.SEO 

593 TTATCAOCTAOCaAOTTAGCTQCTOCAGAAOCC TATTOO AAT GOOA phtB.aao 

S95 TTATCAOCTAOCOAOTTAaCTGCTGCAOAAOCC - - - • TATTOO AATCCOA phtO.SEQ 

649 TTATCTOCTAOTOAATTAOCAOCAOCTAAAOCAC ATCTOOCTOOAA phtE.SEO 

AO-- C A AAT - - OOP ATCTCQTCCTTCTT C A A O T T C T AOTTATACTT Majority 

I I \ \ 

660 670 680 690 700 

* ' ' ' 1 

708 AAATCTGTCAAATTCAAOAACCTATCOCCOACAAAATAOCGATAACACTT phtA.SEO 

638 AO---- - - CA OOOATCTCOTCCTTCTTCAAOTTCTAOTTATAATG phtB.i«9 

641 AG- CA -OOOATCTCOTCCTTCTTCAAOTTCTACTTATAATG phtO.SEQ 

695 A- * A AAT ATOCAACCO AOTC ••••AGTTA-ACCTATTCTT phtE.SEQ 



10/17 



SUBSTITUTE SHEET (RULE 26) 



WO 00/37105 



PCT/US99/30390 



Figure 7 (c) 
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i I I I 1 

960 970 980 990 1000 

' ' 1 

1008 TTACTCTCAAATOTCTOAATTOOAAOAACOAATCaCTCGTATTATTCCCC phtA.SEQ 

906 TTATOAACAAATOTCTOAATTOOAAAAACOAATTOCTCOTATTATTCCCC phtB.«»q 

909 TTATGAACAAATOTCTOAATTGOAAAAACOAATTOCTCQTATTATTCCCC phtD.SEQ 
951 TTACAOCAAOCTTTCTOCCTTAOAAOAAAAOATTOCCAOAAT phtE.SEQ 

TTCOTTATCOTTCAAAC CAT TOO GTACCAQATTCAAOACCAQAAGAACCA Majority 
I I I I ' 

1010 1020 1030 1040 1050 

' L- ^ 

10SB TTCGTTATCGTTCAAACCATTOOOTACCAOATTCAAOOCCAOAACAACCA phtA.SEO 

956 TTCOTTATCCTTCAAACCATTOOOTACCAOATTCAAOACCAOAAGAACCA phtB.MO 

959 TTCOTTATCGTTCAAACCATTOGGTACCAOATTCAAOACCAGAACAACCA phtD.SEO 
993 - - GOTCCC- T- ATCAOTCGAACTG phtE.SEO 
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Figure 7 (d) 



AOTCCACAATCOACTCCGOAACCTAOTCCAAGTCCOCAACCTOCACCAAA Majority 
i » * i i 
1060 1070 1080 1090 1100 
1 ' .1 

HOB AGTCCACAACCGACTCCGOAACCTACTCCAOGCCCCCAACCTGCACCAAA phtA.SEQ 

1006 ACTCCACAACCOACTCCAGAACCTAGTCCAAGTCCGCAACC-- phtB.seq 

1009 ACTCCACAATCGACTCCGGAACCTAGTCCAAGTCCGCAACCTGCACCAAA phCD.SEQ 

1013 GTTCTACAOTT TCTA C A A A -TGCA--AAA phtE.SEQ 

' i 

TC-T-AA AGCT C CAAOCAATCCAATTOATO-OAAATTCOTCA A A G A A G Majority 

I I 1 I I 

1110 1120 1130 1140 1150 

1 

USB TCTTAAAATAGACTCAA---ATTCTTCT-- ----TTGGTTAGTCAGC phtA.SEQ 

1047 * AGCTCCAAGCAATCCAATTOATOGOAAATTOGTCAAAGAAG phtB.seq 

1059 TCCTCAACCAOCTCCAAGCAATCCAATTOATOAOAAATTOGTCAAAGAAG phtO.SEQ 
1039 CC- --TAATO • phtE.SEQ 

CTOTTCGAAAAOTAOOCOATQOTTATOTCTTTQAOOAOAATOOAGTTTCT Majority 

1160 1170 1180 1190 1200 
' ' ' 1 
1196 TGGTACGAAAAGTTOGGGAAOGATATGTATTCGAAGAAAAGOGCATCTCT phtA.SEQ 
10B8 CTOTTCOAAAAGTAOaCOATGOTTATOTCTTTOAOOAGAATOOAOTTTCT phtB.seq 
1109 CTOTTCOAAAAOTAGOC0ATG0TTAT0TCTTTOAOGAOAATGOAGTTTCT phtO.SEQ 
1046 AAGTAO tOTCT AOTCT- - phtE.SEQ 

CCTTATATCCCAOCCAAOOATCTTTCAOCAOAA ACAOCAOCAOOCATTOA Majority 
I I I I I 

1210 1220 1230 1240 1250 

' ' 1 1 

1246 CGTTATGTCTTTOC0AAAOATTTACCATCT0AAACTGTTAAAAATCTTOA phtA.SEQ 

1139 COTTATATCCCAOCCAAGOATCTTTCAaCAOAAACAGCAGCAOGCATTGA phtB.seq 
US 9 CGTTATATCCCAGCCAAOOATCTTTCAOCAOAAACAOCAOCAOOCATTOA phtO.SEQ 
X0(2 - AOGC phtE.SEQ 

TAGCAAACTO OCCAAOCAOOAAAOTTTTTCTCATAAOCTAOOAOCTAAOA Majority 
I » i . . 

1360 1270 1280 - 1290 1300 

' 1 * »- 

1396 AAGCAAGTTATCAAAACAAOAOAOTOTTTCACACACTTTAACTOCTAAAA phtA.SEQ 

1188 TAGCAAACTGCCCAAOCAOOAAAOTTTATCTCATAAOCTAOOAACTAAGA phtB.seq 
1209 TACCAAACTOGCCAAOCAOOAAAOTTTATCTCATAAOCTAOOAOCTAAOA phtO.SEQ 
1066 . AQTCTTTC phtE.SEQ 

AAACTCA-TCTTC CTTCTAOTOA T C GAP A A T T T T A COATAAGOCTTATGAC Majority 
I I I 1 I 

1310 1320 1330 1340 1350 

' T t I ■, I 

1346 AAOAAAATOTTGCTCCTCGTOACCAAOAATTTTATOATAAAOCATATAAT phtA.SEQ 

1233 AAACTOACCTCCCATCTAOTOATCOAGAATTTTACAATAAOOCTTATOAC phtB.seq 
1259 AAACTOACCTCCCATCTAOTOATCOAOAATTTTACAATAAGGCtTATGAC phtO.SEQ 
1074 .AXOCXATCCTTCTTCT TTXXCOXCXXO phtE.SEQ 

TTACTAOCA A QAATTCACCAAQATTTACTTOATAATAAOOOTCQACAAOT Majority 
1 | 1 1 

1360 1370 1380 1390 1400 

trill 

1396 CTGTTAACTGAOOCTCATAAAOCCTTGTTTGNAAATAAOOGTCOTAATTC phtA.SEQ 

1288 TTACTAGCAAOAATTCACCAAOATTTACTTOATAATAAAOGTCOACAAOT phtB.seq 

1309 TTACTAGCAAOAATTCACCAAGATTTACTTGATAATAAAOaTCOACAAGT phtO.SEQ 

H01 . • .-TAAOQA--- phtE.SEQ 
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Figure 7(e) 



TGATTTTOAOOCTT** CATAACCTOTTOOAACCACTCAAGGATOTCTCAA K* jocity 

1410 1420 1430 K40 MSO 

1 ■ * U 

144& TGATTTCCAAOCCTTAGACAAATTATTAGAACOCTTCAATGATGAATCGA phtA.SEO 
1338 TOATTTTOAOOCTTTOOATAACCTaTTOOAACOACTCAAQOATCTCTCAA phtB.««q 
1359 TCATTTTQAOaCTTTCaXTAACCTOTTCOAACGACTCAAOOATCTCCCAA phCD.SEC 
1107 phtE.SEQ 

G T OA TAAAOTCAAOTTAOTOOATOATATTCTTOCCTTCTTAOCTCCOATT Majority 
♦ till 

1440 1470 1410 1490 1500 

I I I I L. 

1496 CTAATAAAGAAAAATTOGTAOATQATTTATTOGCATTCCTACCACCAATT phtA.SEQ 
1388 CTGATAAAGTCAAGTTAGTCGAAOATATTCTTOCCTTCTTAGCTCCGATT phta.Jaq 
1409 CTGATAAAaTCAAOTTACTOCATOATATTCTTOCCTTCTTAGCTCCGATT phtD.SEQ 
1107 OCTCTCTT phtE.SEQ 

COTCATCCAOAACOTTTAOOAAAACCAAATOCOCAAATTACCTACACTOA Majority 
« 1 > """""" » » I 
1S10 1530 1S30 1540 1550 
I t I | i 

1546 ACCCATCCAGAGCOACTTCOCAAACCAAATTCTCAAATTGACTATACTGA phtA.SEC 

1438 CGTCATCCAOAACOTTTAGOAAAACCAAATaCGCAAATTACCTACACTGA phtB.««q 

1459 COTCATCCAOAACOTTTAOOAAAACCAAATaCOCAAATTACCTACACTOA phtO.SEQ 

1115 . - . - . phtE.SEC 

TO AT O AO ATTCAAOTAOCCAAOTTOOCAOO CAAOTACACAOCATCAGATO Majority 

i i i i r 

1560 1570 1580 1590 1600 

' * » ' ' 

1596 AOACOAAOTTCOTATTOCTCAATTAOCTOATAAGTATACAACGTCAOATG phtA.SEO 
I486 TOATCAOATTCAAOTAOCCAAOTTOOCAOOCAAOTACACAOCAOAAGACO phtB.sao 
1509 TOATOAOATTCAXaTAOCCAAOTTOGCAOOCAAaTACACAACAOAACACG phtO.SEQ 
1115 - - CAOCATCTGATG phtE.SEO 

OT TATATTTTTOATCCTCOTQATATAACCAOTOATCAOCOOOATOCCTA T Majority 

\ i i i : r 

16X0 1630 1630 1640 1650 

lift u 

1646 GTTACATTTTTOATOAACATOATATAATCACTOATOAAGOAOATOCATAT phtA.SEp 
1538 CTTATATCTTTQATCCTCOTOATATAACCAQTOATOAOGOOOATOCCTAT phtB.aao 
1559 CTTATATCTTTOATCCTCOTOATATAACCAOTOATOAOOOOOATOCCTAT phtO.SEp 
1137 OTTATATTTTTAAfCC - - - phtS.SEp 

OTAACTCCACATAtOACCCAT AO CCACTOOAT TAAAAAAO A T A O T T T C T C Majority 

1 114 11 9 

1660 1670 1680 1690 1700 

lilt U 

1696 GTAACOCCTCATATOOOCCATAOTCACTOOATTOOAAAAOATAOCCTTTC phtA.SS? 
1588 GTAACTCCACATATOACCCATAGCCACTOOATTAAAAAAGATAGTTTGTC pbtB.saO 
1609 CTAACTCCACATATSACCCATAQCCACTOOATTAAAAAAQATAGTTTGTC phtO.SEfl 
1143 m AAAAOATA TC phtE.SEQ 

TOAAOCTGAOAOAOCOOCAOCCCAOOCT TATOCTAAAOAOAA AOOTT T G A Ha jorlt/ 

I I r 

1710 1730 1730 1740 1750 

1746 TGATAAGQAAAAAOTTOCAOCTCAAOCCTATACTAAAOAAAAAGGTATCC phtA.SEp 

1638 TGAAGCTGAOAOAOCOOCAaCCCAOGCTTATOCTRAAOAGAAAOGTTTGA pht&.stO 

1659 TGAAOCTOAOAOAQCOOCAOCCCAQOCTTATOCTAAAOAQAAAQOTTTCA phtO.SEQ 

1153 OTTOAAQAAACOOC . - phtE.SEQ 
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Figure 7(f) 



C C C C T C CTTCOACAO J C A T C A O O A T T C A G G X A A TACTGAGGCAAAAGGA Majority 

i t * I \ 

1760 1770 1780 1790 1800 

1 

1796 TACCTCCATCTCCAGACGCAGATGTTAAAGCAAATCCAACTGGAOATACT phtA.SEQ 
1688 CCCCTCCTTCCACAOACCATCAOGATTCAGGAAATACTGAGGCAAAAGOA phtB.aeq 
1709 CCCCTCCTTCGACAOACCATCACOATTCAGGAAATACTOAGGCAAAAGGA phtO.SEQ 
1167 - . phtE.SEQ 

O C AGAAOCTATCTACAACCOXGTQAAAOCAQCTAAOA A G G T G C C A C'T T C A Majority 
I I I I I 

1810 1820 1830 U40 18S0 

■ t i . ■ 

1846 OCAGCAOCTATTTACAATCOTOTOAAAOOOOAAAAACGAATTCCACTCGT phtA.SEQ 

.1738 OCAOAAOCTATCTACAACCOHGTOAAAOCAGCTAAOAACGTOCCACTTGA phtB.saq 

17S9 OCAOAAGCTATCTACAACCaCOTaAAAOCAOCTAAOAACOTGCCACTTCA phtO.SEQ 

1167 TXCAOCT phtE.SEQ 

TCGTATQCCTTACAATCTTCAATATACTOTAQAAOT C A A A A A C G OTAGTT Majority 

I I I . I I 

1860 1870 1880 1890 1900 

i i i t t_ 

1895 TCGACTTCCATATATOOTTOAOCATACAOTTGAOOTTAAAAACGGTAATT phtA.SEQ 

1788 TCGTATOCCTTACAATCTTCAATATACTOTAOAAGTCAAAAACaOTACTT phtB.aaq 

1809 TCOTATOCCTTACAATCTTCAATATACTOTAOAAGTCAAAAACGaTAGTT phtO.SEQ 

1174 TATATTGTAAOA phtE.SEQ 

T A A T C ATACCTCATTATOATCATTACCATAACATT A A'ATT T OAOTQOTTT Majority 
I I I I I 

1910 1920 1930 1940 19S0 
1 1 1 ' 1 
1946 TOATTATTCCTCATAAGOATCATTACCATAATATTAAATTTGCTTOGTTT phtA.SEQ 
1838 TAATCATACCTCATTATOACCATTACCATAACATCAAATTTGAGTOGTTX phtB.aaq 
18S9 TAATCATACCTCATTATOACCATTACCATAACATCAAATTTOAOTGOTTT phtO.SEQ 
1186 CATOOTOATCATTTCCATTACATT phtE.SEQ 

OACOAAOOCCTTTATOAOOCACCTAAOOOOTATACTCTTOACQATCTTTT Majority 
• 1 i i I i 

1960 1970 1980 1990 3000 

1 ' * 

1996 OATOATCACACATACAAAOCTCCAAATOGCTATACCTTOOAAOATTTCTT phtA.SEQ 

1888 OACGAAOOCCTTTATOAOOCACCTAAOOOGTATACTCTTOAOGATCTTTT pht&.aa* 
1909 GACOAAOOCCTTTATOAOOCACCTAAOOGOTATACTCTTOAOOATCTTTT phtO.SEQ 
1210 CC11X - - phtE.SEQ 

GOCQACTOTCAAOTACTATQTCOAACATCCAOACOAACOTCCQCATTCAO Majority 

i I 1 

2010 2020 2030 3Q40 2050 

1 1 • ' ' ' * 

3046 TGCGACGATTAAGTACTACaTAQAACACCCTOACOAACGTCCACATTCTA phtA.SEQ 

1938 aOCOACTOTCAAGTACTATOTCOAACATCCAAACGAACOTCCOCATTCAO phtfi.aaq^ 
1959 OOCOACTOTCAAaTACTATOTCGAACATCCAAACOAACOTCCOCATTCAC phtO.SEQ 
1215 ATCAAAT--CAAATTOOOCAACC-GAC- -TCT TCCAA phtE.SEQ 

ATAATOOTTTTOOTAACOCTAOCOACCATOT T TTXA-AAACA A OAAAGAT Majority 
» ■ 1 i 
2060 2070 2080 2090 2100 . 
1 1 ' ' ' 
2096 ATGATOOATGOGGCAATQCCAGTaAQCATOTGTT*--AGGCAAGAAAGAC phtA.SEQ 
1988 ATAATaaTTTTaGTAACOCTAOCOACCATOTTCAAAaAAACAAAAATaOT phtB.sta, 
2009 ATAATGGTTTTGOTAACOCTAOCOACCATaTTCOTAAAAATAAGOTAOAC phtO.SEQ 
1247 ACAATAO-TCTAGCAACACCTTCT-CCATCTCTTC-----CAA-- T phtE.SEQ 
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Figure 7(g) 



C-AAOCCAOTAAACCTAATOAAOATC AGAAACATOACCAACTAAG-GAO-- K* jortty 

22 2120 22 22 2150 

2" CA CAGTCAACAT 1 CCAAATAAG - phtA.SEQ 

2038 CAAGCTOATA CCAATCAA- ACGGAAAA ACCAAGCGAG-GAGAA 

2059 CAAGACAGTAAACCTGATOAAGATAAOOAACATOATGAAGTAAC 
1284 CAATCCAG-GAACTTCAC ATGAGAAACATGA 



phCB.scq 
T G A G C C phtO.SEQ 
phCE.SEQ 



A-CTCA--C-OAA TOAAOAAOA -AACCACO--0-TTTAAATCCT* Majority 

22 '"° 2180 2X90 2200 

* • AAC- -TTCAAA DhtA.SEO 

2079 ACCTCAGACAOAAAAACCTGAOOAAGA-AACC CCTC BhtB 

2X09 AACTCACCCTGAATCTOATGAAAAAGAGAATCACGCTGGTTTAAATCCTT phtolsS 
13X4 •-•--------AGAAGATGOATACO--GATTTGA-TCCT - pheC.SEQ 

-AOCAOATAAACCOTATAAOCCAO- -AC- A-AC*-A-»»A Majority 

2210 22 22 2240 2250 

2173 * - GCOOATOAA OAOCCAO - i 'J ' htA 

2XX4 CSACAAOAOAAACCOCA-AAOCOAOAAACCAOAOTCTCCAAAACCAACAOA pbta!.« 
2X59 CAGCAG A T A A T C T T T A T A A A C C A A O C A C T O A T A C 0 O AAGAOACAO AOOAA phtO.SEQ 
1339 AATCOTATTATC phtX.SSQ 

Q-AQCTOQAOQAAXCACCAOATQAQTCAOAAOTXC C T C A 



2260 2270 2210 



_ --AOTAQAOACTO Majority 
• t | 



i 2290 2300 

2189 TXOACOAXXCACCTOCTOAOCCXOAAflTCCCTCAAOTXCXCACTC phtA.SZQ 

2X63 CCAACCAOAACAATCACCAGAOOAATCAOAAOAACCTCAOCTCOAOACTO phtB...q 

2209 OAAGCIQAACATACCAC-AOATOAOOCTOAAATTCCTCAAOIAOAOAATT phtD.SSO 

X35X G C T G A A G A T O A A T C phtX.SEQ 

AAAAXGTTOAAOCXAAACTXAXAO AXQCXOAOOTTTTOCTTQXAAAAOTC Majority 
22 3320 2"0 2340 2350 

2234 **^*OTAOAAflCCCAACTCAAAOAAOCACAAOTTTTOCTTOCOAAACTA phtA.SZQ 
22X3 AAAAGGTTGAAGAAAAACTGAGAOAGOCTOAAGATTTACTTGGAAAAATC phtB.a.q 
225B CTCTTATTAACOCTAAOAtAOCAOATOCOOAOOCCTTOCTAOAAAAAOTX phtO.SEQ 
AOOTTTTO TC pbCX.SXQ 

ACGOATCCTAOTATXAAAXCCA ATQCXACOOAOACTCTXACTQQTTTAAA Majority 
23 . 60 23 ?° 2310 2390 2400 

ACGOATTCTAOTCTOAAAOCCAATOCAACAOAAACTCTAOCTOOTTTACG phtA.SEQ 
2263 CAOGATCCAATTATCAAOTCCAATOCCAAAOAOACTCTCACAOOATTAAA phtB..*, 
2308 ACACATCCTAOTAtTAOACAAAATOCTATOOAOAC A T T O A C T O G T C T A A A phtO.StQ 

A T G A G T C - ACGOAOACC phtC.SEC 

AAATAATTTXCTTCTTOOAA CXAAQOATAATAATACTATTTTOGCAQAAC Majority 

22 24 ?° 2430 2440 2«0 

1 1 I ■ I 



TGQCAGAAG phtA.SEO 



2334 AAATAATTTO ACTCTTCAAATTATOQATAACAATAOTAICA . . . 

^ XnATTIAC ^"TOOCACCCAOaACAACAATACTATTATOOCAOAAO phtB.aaq 
2358 AAOT AOTCTTCTTCTCGOAACOAAAOATAATAACACTATTTCAOC AG AAO phtD.StO 

UAAT--CATTATTTCTTCA phtX.STG 
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Figure 7 (h) 



CAOAAAGACTATTOOCTTTOTTAAAOOAQAOTAAXT - A A G G T CTT Majority 

2460 2470 2480 2490 2500 

1 1 1 ' 

2384 CAGAAAAATTACTTOCGTTOTTAAAAGGAAOTAATC---- -CTT phtA.SEC 

2363 CTGAAAAACTATTGOCTTTATTAAAOOAOAOTAAGTAAAOOTAOAAGCTT phtB.*«q 

2408 TAGATAOTCTCTTGOCTTTOTTAAAAGAAAGT- - * - - - - c phtD.SEQ 

1409 - AGAAGGACT--TOAC----- AOAAGAGCAAATTAAGGT phtE.SEQ 

* * K 

AA--OCO--TCTOOC-CCTA-0-CAA-AA-A-T--TATGOXAAAAGCTXA MAjorlty 
• * * i I 

2S10 2520 2530 2540 2550 

1 ' 1 ' 
2433 CA TCTO T A A O - - -TAAGGAAAAAAT - • • phCA.SCQ 

2413 AAGGOCOAATTTGGCACCCAOGACAACAATACTATTATGGCAOAAGCTOA phtB.svq 

2441 AA CCOOCTCCTA ---TATAGTAAAACCTTA phtD.SEQ 

1440 OCO- CAAAAACATT TAG phtZ.SEQ 

AAAACTAXX Majority 



2445 -AAACTAA 
2463 AAAACTATT 

2468 AO CC 

1455 



phtA.SEQ 
phtB.swj 
phtD.SEQ 
phCX.SXQ 
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Figure 8 

Strain SJ2 (serotype 6B) 
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Strain EF5668 (serotype 4) 
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SEQUENCE LISTING 

<110> Johnson, Leslie S. 
Koenig, Scott 
Adamou , John E. 

<120> Streptococcus Pneumoniae and Immunogenic Fragments for 
Vaccines 

<130> 469201-444 

<140> 
<141> 

<150> 60/113,048 
<151> 1998-12-21 

<160> 11 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Forward primer 
used in amplification of the Sp3 6 gene sequence. 

<400> 1 

atcggatcct tcttacgagt tgggactgta tcaagc 36 

<210> 2 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Forward primer 
used in amplification of the Sp36 gene sequence. 

<400> 2 

atcggatcca ctgtatcaag ctagaacggt taagg 35 

<210> 3 
<211> 40 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Reverse primer 
used in amplification of the Sp3 6 gene sequence. 

<400> 3 

agtcaagctt gtttattttt tccttactta cagatgaagg 40 

<210> 4 
<211> 838 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 4 

Met Lys He Asn Lys Lys Tyr Leu Ala Gly Ser Val Ala Val Leu Ala 
15 io 15 

Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg His Gin Ala Gly Gin Val 
20 25 30 

Lys Lys Glu Ser Asn Arg Val Ser Tyr He Asp Gly Asp Gin Ala Gly 
35 40 45 

Gin Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly 
50 55 60 

He Asn Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 
85 90 95 

Asp Ala He He Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 
100 105 110 

Leu Lys Asp Ser Asp He Val Asn Glu He Lys Gly Gly Tyr Val He 
115 120 125 

Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn He Arg Thr Lys Glu Glu He Lys Arg Gin Lys Gin Glu His 
145 150 155 160 

Ser His Asn His Gly Gly Gly Ser Asn Asp Gin Ala Val Val Ala Ala 
165 170 175 
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Arg Ala Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr lie Phe Asn Ala 
180 185 190 

Ser Asp lie lie Glu Asp Thr Gly Asp Ala Tyr He Val Pro His Gly 
195 200 205 

Asp His Tyr His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu 
210 215 220 

Ala Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser 
225 230 235 240 

Ser Ser Ser Ser Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu 
245 250 255 

Asn His Asn Leu Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu 
260 265 270 

Asn He Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu 
275 280 285 

Arg His Val Glu Ser Asp Gly Leu He Phe Asp Pro Ala Gin He Thr 
290 295 300 

Ser Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His 
305 310 315 320 

Phe He Pro Tyr Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg 
325 330 335 

He He Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg 
340 345 350 

Pro Glu Gin Pro Ser Pro Gin Ser Thr Pro Glu Pro Ser Pro Ser Pro 
355 360 365 

Gin Pro Ala Pro Asn Pro Gin Pro Ala Pro Ser Asn Pro He Asp Glu 
370 375 380 

Lys Leu Val Lys Glu Ala Val Arg Lys Val Gly Asp Gly Tyr Val Phe 
385 390 395 400 

Glu Glu Asn Gly Val Ser Arg Tyr He Pro Ala Lys Asp Leu Ser Ala 
405 410 415 

Glu Thr Ala Ala Gly He Asp Ser Lys Leu Ala Lys Gin Glu Ser Leu 
42 <> 425 430 
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Ser His Lys Leu Gly Ala Lys Lys Thr Asp Leu Pro Ser Ser Asp Arg 
435 440 445 

Glu Phe Tyr Asn Lys Ala Tyr Asp Leu Leu Ala Arg lie His Gin Asp 
450 455 460 

Leu Leu Asp Asn Lys Gly Arg Gin Val Asp Phe Glu Ala Leu Asp Asn 
465 470 475 480 

Leu Leu Glu Arg Leu Lys Asp Val Pro Ser Asp Lys Val Lys Leu Val 
485 490 495 

Asp Asp He Leu Ala Phe Leu Ala Pro He Arg His Pro Glu Arg Leu 
500 505 510 

Gly Lys Pro Asn Ala Gin He Thr Tyr Thr Asp Asp Glu He Gin Val 
515 520 525 

Ala Lys Leu Ala Gly Lys Tyr Thr Thr Glu Asp Gly Tyr He Phe Asp 
530 535 540 

Pro Arg Asp He Thr Ser Asp Glu Gly Asp Ala Tyr Val Thr Pro His 
545 550 555 560 

Met Thr His Ser His Trp He Lys Lys Asp Ser Leu Ser Glu Ala Glu 
565 570 575 

Arg Ala Ala Ala Gin Ala Tyr Ala Lys Glu Lys Gly Leu Thr Pro Pro 
580 585 590 

Ser Thr Asp His Gin Asp Ser Gly Asn Thr Glu Ala Lys Gly Ala Glu 
595 600 605 

Ala He Tyr Asn Arg Val Lys Ala Ala Lys Lys Val Pro Leu Asp Arg 
610 615 620 

Met Pro Tyr Asn Leu Gin Tyr Thr Val Glu Val Lys Asn Gly Ser Leu 
625 630 635 640 

He He Pro His Tyr Asp His Tyr His Asn He Lys Phe Glu Trp Phe 
645 650 655 

Asp Glu Gly Leu Tyr Glu Ala Pro Lys Gly Tyr Thr Leu Glu Asp Leu 
660 665 670 

Leu Ala Thr Val Lys Tyr Tyr Val Glu His Pro Asn Glu Arg Pro His 
675 680 685 
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Ser Asp Asn Gly Phe Gly Asn Ala Ser Asp His Val Arg Lys Asn Lys 
690 695 700 

Val Asp Gin Asp Ser Lys Pro Asp Glu Asp Lys Glu His Asp Glu Val 
705 710 715 720 

Ser Glu Pro Thr His Pro Glu Ser Asp Glu Lys Glu Asn His Ala Gly 
725 730 735 

Leu Asn Pro Ser Ala Asp Asn Leu Tyr Lys Pro Ser Thr Asp Thr Glu 
740 745 750 

Glu Thr Glu Glu Glu Ala Glu Asp Thr Thr Asp Glu Ala Glu lie Pro 
755 760 765 

Gin Val Glu Asn Ser Val lie Asn Ala Lys He Ala Asp Ala Glu Ala 
770 775 780 

Leu Leu Glu Lys Val Thr Asp Pro Ser He Arg Gin Asn Ala Met Glu 
785 790 795 800 

Thr Leu Thr Gly Leu Lys Ser Ser Leu Leu Leu Gly Thr Lys Asp Asn 
805 810 815 

Asn Thr He Ser Ala Glu Val Asp Ser Leu Leu Ala Leu Leu Lys Glu 
820 825 830 

Ser Gin Pro Ala Pro He 
835 



<210> 5 
<211> 2531 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 5 

atgaaaatta ataaaaaata tctagcaggt 
tcctatgaac ttggtcgtca ccaagctggt 
tatatagatg gtgatcaggc tggtcaaaag 
aagagggagg ggatcaacgc cgaacaaatc 
acctctcatg gagaccatta tcattactat 
agtgaagagc tcctcatgaa agatccgaat 
gaaatcaagg gtggttatgt tatcaaggta 
gcagctcatg cggataatat tcggacaaaa 
agtcataatc acgggggtgg ttctaacgat 
cgctatacaa cggatgatgg ttatatcttc 
gatgcttata tcgttcctca cggcgaccat 



tcagtggcag tccttgccct aagtgtttgt 60 
caggttaaga aagagtctaa tcgagtttct 120 
gcagaaaact tgacaccaga tgaagtcagt 180 
gtcatcaaga ttacggatca aggttatgtg 240 
aatggcaagg tcccttatga tgccatcatc 300 
tatcagttga aggattcaga cattgtcaat 360 
gatggaaaat actatgttta ccttaaggat 420 
gaagagatta aacgtcagaa gcaggaacac 480 
caagcagtag ttgcagccag agcccaagga 540 
aatgcatctg atatcattga ggacacgggt 600 
taccattaca ttcctaagaa tgagttatca 660 
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gctagcgagt tagctgctgc agaagcctat tggaatggga agcagggatc tcgtccttct 720 
tcaagttcta gttataatgc aaatccagct caaccaagat tgtcagagaa ccacaatctg 780 
actgtcactc caacttatca tcaaaatcaa ggggaaaaca tttcaagcct tttacgtgaa 840 
ttgtatgcta aacccttatc agaacgccat gtggaatctg atggccttat tttcgaccca 900 
gcgcaaatca caagtcgaac cgccagaggt gtagctgtcc ctcatggtaa ccattaccac 960 
tttatccctt atgaacaaat gtctgaattg gaaaaacgaa ttgctcgtat tattcccctt 1020 
cgttatcgtt caaaccattg ggtaccagat tcaagaccag aacaaccaag tccacaatcg 1080 
actccggaac ctagtccaag tccgcaacct gcaccaaatc ctcaaccagc tccaagcaat 1140 
ccaattgatg agaaattggt caaagaagct gttcgaaaag taggcgatgg ttatgtcttt 1200 
gaggagaatg gagtttctcg ttatatccca gccaaggatc tttcagcaga aacagcagca 1260 
ggcattgata gcaaactggc caagcaggaa agtttatctc ataagctagg agctaagaaa 1320 
actgacctcc catctagtga tcgagaattt tacaataagg cttatgactt actagcaaga 1380 
attcaccaag atttacttga taataaaggt cgacaagttg attttgaggc tttggataac 1440 
ctgttggaac gactcaagga tgtcccaagt gataaagtca agttagtgga tgatattctt 1500 
gccttcttag ctccgattcg tcatccagaa cgtttaggaa aaccaaatgc gcaaattacc 1560 
tacactgatg atgagattca agtagccaag ttggcaggca agtacacaac agaagacggt 1620 
tatatctttg atcctcgtga tataaccagt gatgaggggg atgcctatgt aactccacat 1680 
atgacccata gccactggat taaaaaagat agtttgtctg aagctgagag agcggcagcc 174 0 
caggcttatg ctaaagagaa aggtttgacc cctccttcga cagaccatca ggattcagga 1800 
aatactgagg caaaaggagc agaagctatc tacaaccgcg tgaaagcagc taagaaggtg 1860 
ccacttgatc gtatgcctta caatcttcaa tatactgtag aagtcaaaaa cggtagttta 1920 
atcatacctc attatgacca ttaccataac atcaaatttg agtggtttga cgaaggcctt 1980 
tatgaggcac ctaaggggta tactcttgag gatcttttgg cgactgtcaa gtactatgtc 204 0 
gaacatccaa acgaacgtcc gcattcagat aatggttttg gtaacgctag cgaccatgtt 2100 
cgtaaaaata aggtagacca agacagtaaa cctgatgaag ataaggaaca tgatgaagta 2160 
agtgagccaa ctcaccctga atctgatgaa aaagagaatc acgctggttt aaatccttca 2220 
gcagataatc tttataaacc aagcactgat acggaagaga cagaggaaga agctgaagat 2280 
accacagatg aggctgaaat tcctcaagta gagaattctg ttattaacgc taagatagca 2340 
gatgcggagg ccttgctaga aaaagtaaca gatcctagta ttagacaaaa tgctatggag 2400 
acattgactg gtctaaaaag tagtcttctt ctcggaacga aagataataa cactatttca 2460 
gcagaagtag atagtctctt ggctttgtta aaagaaagtc aaccggctcc tatatagtaa 2520 
aagcttaagc c 2531 

<210> 6 
<211> 484 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 6 

Met Lys Phe Ser Lys Lys Tyr He Ala Ala Gly Ser Ala Val He Val 



1 5 io 



15 



Ser Leu Ser Leu Cys Ala Tyr Ala Leu Asn Gin His Arg Ser Gin Glu 
20 25 30 

Asn Lys Asp Asn Asn Arg Val Ser Tyr Val Asp Gly Ser Gin Ser Ser 
35 40 45 
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Gin Lys Ser Glu Asn Leu Thr Pro Asp Gin Val Ser Gin Lys Glu Gly 
50 55 60 

lie Gin Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 
85 90 95 

Asp Ala Leu Phe Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 
100 105 110 

Leu Lys Asp Ala Asp He Val Asn Glu Val Lys Gly Gly Tyr He He 
115 120 125 

Lys Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn Val Arg Thr Lys Asp Glu He Asn Arg Gin Lys Gin Glu His 
145 150 155 160 

Val Lys Asp Asn Glu Lys Val Asn Ser Asn Val Ala Val Ala Arg Ser 
165 170 175 

Gin Gly Arg Tyr Thr Thr Asn Asp Gly Tyr Val Phe Asn Pro Ala Asp 
180 185 190 

He He Glu Asp Thr Gly Asn Ala Tyr He Val Pro His Gly Gly His 
195 200 205 

Tyr His Tyr He Pro Lys Ser Asp Leu Ser Ala Ser Glu Leu Ala Ala 
2X0 215 220 

Ala Lys Ala His Leu Ala Gly Lys Asn Met Gin Pro Ser Gin Leu Ser 
225 230 235 240 

Tyr Ser Ser Thr Ala Ser Asp Asn Asn Thr Gin Ser Val Ala Lys Gly 
245 250 255 

Ser Thr Ser Lys Pro Ala Asn Lys Ser Glu Asn Leu Gin Ser Leu Leu 
260 265 270 

Lys Glu Leu Tyr Asp Ser Pro Ser Ala Gin Arg Tyr Ser Glu Ser Asp 
275 280 285 

Gly Leu Val Phe Asp Pro Ala Lys He He Ser Arg Thr Pro Asn Gly 
290 295 300 
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Val Ala He Pro His Gly Asp His Tyr His Phe He Pro Tyr Ser Lys 
305 310 315 320 

Leu Ser Ala Leu Glu Glu Lys He Ala Arg Met Val Pro He Ser Gly 
325 330 335 

Thr Gly Ser Thr Val Ser Thr Asn Ala Lys Pro Asn Glu Val Val Ser 
340 345 350 

Ser Leu Gly Ser Leu Ser Ser Asn Pro Ser Ser Leu Thr Thr Ser Lys 
355 360 365 

Glu Leu Ser Ser Ala Ser Asp Gly Tyr He Phe Asn Pro Lys Asp He 
370 375 380 

Val Glu Glu Thr Ala Thr Ala Tyr He Val Arg His Gly Asp His Phe 
385 390 395 400 

His Tyr He Pro Lys Ser Asn Gin He Gly Gin Pro Thr Leu Pro Asn 
405 410 

Asn Ser Leu Ala Thr Pro Ser Pro Ser Leu Pro He Asn Pro Gly Thr 
42 <> 425 430 

Ser His Glu Lys His Glu Glu Asp Gly Tyr Gly Phe Asp Ala Asn Arg 
435 440 445 

He He Ala Glu Asp Glu Ser Gly Phe Val Met Ser His Gly Asp His 
450 455 460 



Asn His Tyr Phe Phe Lys Lys Asp Leu Thr Glu Glu Gin He Lys Val 
465 

Arg Lys Asn He 



465 470 475 480 



<210> 7 
<211> 1455 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 7 

atgaaattta gtaaaaaata tatagcagct 
tgtgcctatg cactaaacca gcatcgttcg 
tatgtggatg gcagccagtc aagtcagaaa 
cagaaagaag gaattcaggc tgagcaaatt 
acgtcacacg gtgaccacta tcattactat 



ggatcagctg ttatcgtatc cttgagtcta 60 
caggaaaata aggacaataa tcgtgtctct 120 
agtgaaaact tgacaccaga ccaggttagc 180 
gtaatcaaaa ttacagatca gggctatgta 240 
aatgggaaag ttccttatga tgccctcttt 300 
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agtgaagaac tcttgatgaa ggatccaaac tatcaactta aagacgctga tattgtcaat 3 60 
gaagtcaagg gtggttatat catcaaggtc gatggaaaat attatgtcta cctgaaagat 420 
gcagctcatg ctgataatgt tcgaactaaa gatgaaatca atcgtcaaaa acaagaacat 480 
gtcaaagata atgagaaggt taactctaat gttgctgtag caaggtctca gggacgatat 540 
acgacaaatg atggttatgt ctttaatcca gctgatatta tcgaagatac gggtaatgct 600 
tatatcgttc ctcatggagg tcactatcac tacattccca aaagcgattt atctgctagt 660 
gaattagcag cagctaaagc acatctggct ggaaaaaata tgcaaccgag tcagttaagc 720 
tattcttcaa cagctagtga caataacacg caatctgtag caaaaggatc aactagcaag 780 
ccagcaaata aatctgaaaa tctccagagt cttttgaagg aactctatga ttcacctagc 840 
gcccaacgtt acagtgaatc agatggcctg gtctttgacc ctgctaagat tatcagtcgt 900 
acaccaaatg gagttgcgat tccgcatggc gaccattacc actttattcc ttacagcaag 960 
ctttctgcct tagaagaaaa gattgccaga atggtgccta tcagtggaac tggttctaca 1020 
gtttctacaa atgcaaaacc taatgaagta gtgtctagtc taggcagtct ttcaagcaat 1080 
ccttcttctt taacgacaag taaggagctc tcttcagcat ctgatggtta tatttttaat 1140 
ccaaaagata tcgttgaaga aacggctaca gcttatattg taagacatgg tgatcatttc 1200 
cattacattc caaaatcaaa tcaaattggg caaccgactc ttccaaacaa tagtctagca 1260 
acaccttctc catctcttcc aatcaatcca ggaacttcac atgagaaaca tgaagaagat 1320 
ggatacggat ttgatgctaa tcgtattatc gctgaagatg aatcaggttt tgtcatgagt 1380 
cacggagacc acaatcatta tttcttcaag aaggacttga cagaagagca aattaaggtg 1440 
cgcaaaaaca tttag 1455 

<210> 8 
<211> 819 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 8 

Met Lys lie Asn Lys Lys Tyr Leu Val Gly Ser Ala Ala Ala Leu He 
15 10 15 

Leu Ser Val Cys Ser Tyr Glu Leu Gly Leu Tyr Gin Ala Arg Thr Val 
20 25 30 

Lys Glu Asn Asn Arg Val Ser Tyr He Asp Gly Lys Gin Ala Thr Gin 
35 40 45 

Lys Thr Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly He 
50 55 60 

Asn Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val Thr 
65 70 75 80 

Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr Asp 
85 90 95 

Ala He He Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Lys Leu 
100 105 no 
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Lys Asp Glu Asp lie Val Asn Glu Val Lys Gly Gly Tyr Val lie Lys 
115 120 125 

Val Asp Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala Asp 
130 135 140 

Asn Val Arg Thr Lys Glu Glu He Asn Arg Gin Lys Gin Glu His Ser 
145 150 155 160 

Gin His Arg Glu Gly Gly Thr Pro Arg Asn Asp Gly Ala Val Ala Leu 
165 170 175 

Ala Arg Ser Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn 
180 185 190 

Ala Ser Asp He He Glu Asp Thr Gly Asp Ala Tyr He Val Pro His 
195 200 205 

Gly Asp His Tyr His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu 
210 215 220 

Leu Ala Ala Ala Glu Ala Phe Leu Ser Gly Arg Gly Asn Leu Ser Asn 
225 230 235 240 

Ser Arg Thr Tyr Arg Arg Gin Asn Ser Asp Asn Thr Ser Arg Thr Asn 
245 250 255 

Trp Val Pro Ser Val Ser Asn Pro Gly Thr Thr Asn Thr Asn Thr Ser 
260 265 270 

Asn Asn Ser Asn Thr Asn Ser Gin Ala Ser Gin Ser Asn Asp He Asp 
275 280 285 

Ser Leu Leu Lys Gin Leu Tyr Lys Leu Pro Leu Ser Gin Arg His Val 
290 295 300 



Glu Ser Asp Gly Leu Val Phe Asp Pro Ala Gin He Thr Ser Arg Thr 
305 310 315 



320 



Ala Arg Gly Val Ala Val Pro His Gly Asp His Tyr His Phe He Pro 
325 330 335 

Tyr Ser Gin Met Ser Glu Leu Glu Glu Arg He Ala Arg He He Pro 
34 ° 345 350 



Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro Glu Gin 
355 360 365 
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Pro Ser Pro Gin Pro Thr Pro Glu Pro Ser Pro Gly Pro Gin Pro Ala 
370 375 380 

Pro Asn Leu Lys lie Asp Ser Asn Ser Ser Leu Val Ser Gin Leu Val 
385 390 395 400 

Arg Lys Val Gly Glu Gly Tyr Val Phe Glu Glu Lys Gly He Ser Arg 
405 410 415 

Tyr Val Phe Ala Lys Asp Leu Pro Ser Glu Thr Val Lys Asn Leu Glu 
420 425 430 

Ser Lys Leu Ser Lys Gin Glu Ser Val Ser His Thr Leu Thr Ala Lys 
435 440 445 

Lys Glu Asn Val Ala Pro Arg Asp Gin Glu Phe Tyr Asp Lys Ala Tyr 
450 455 460 

Asn Leu Leu Thr Glu Ala His Lys Ala Leu Phe Glu Asn Lys Gly Arg 
465 470 475 480 

Asn Ser Asp Phe Gin Ala Leu Asp Lys Leu Leu Glu Arg Leu Asn Asp 
485 490 495 

Glu Ser Thr Asn Lys Glu Lys Leu Val Asp Asp Leu Leu Ala Phe Leu 
500 505 510 

Ala Pro He Thr His Pro Glu Arg Leu Gly Lys Pro Asn Ser Gin He 
515 520 525 

Glu Tyr Thr Glu Asp Glu Val Arg He Ala Gin Leu Ala Asp Lys Tyr 
530 535 540 

Thr Thr Ser Asp Gly Tyr He Phe Asp Glu His Asp He He Ser Asp 
545 550 555 560 

Glu Gly Asp Ala Tyr Val Thr Pro His Met Gly His Ser His Trp He 
565 570 575 

Gly Lys Asp Ser Leu Ser Asp Lys Glu Lys Val Ala Ala Gin Ala Tyr 
580 585 590 

Thr Lys Glu Lys Gly He Leu Pro Pro Ser Pro Asp Ala Asp Val Lys 
595 600 605 

Ala Asn Pro Thr Gly Asp Ser Ala Ala Ala He Tyr Asn Arg Val Lys 
61 <> 615 620 
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Gly Glu Lys Arg lie Pro Leu Val Arg Leu Pro Tyr Met Val Glu His 
625 630 635 640 

Thr Val Glu Val Lys Asn Gly Asn Leu He He Pro His Lys Asp His 
645 650 655 

Tyr His Asn He Lys Phe Ala Trp Phe Asp Asp His Thr Tyr Lys Ala 
660 665 670 

Pro Asn Gly Tyr Thr Leu Glu Asp Leu Phe Ala Thr He Lys Tyr Tyr 
675 680 685 

Val Glu His Pro Asp Glu Arg Pro His Ser Asn Asp Gly Trp Gly Asn 
690 695 700 

Ala Ser Glu His Val Leu Gly Lys Lys Asp His Ser Glu Asp Pro Asn 
705 710 715 720 

Lys Asn Phe Lys Ala Asp Glu Glu Pro Val Glu Glu Thr Pro Ala Glu 
725 730 735 

Pro Glu Val Pro Gin Val Glu Thr Glu Lys Val Glu Ala Gin Leu Lys 
740 745 750 

Glu Ala Glu Val Leu Leu Ala Lys Val Thr Asp Ser Ser Leu Lys Ala 
755 760 765 

Asn Ala Thr Glu Thr Leu Ala Gly Leu Arg Asn Asn Leu Thr Leu Gin 
770 775 780 

He Met Asp Asn Asn Ser He Met Ala Glu Ala Glu Lys Leu Leu Ala 
785 790 795 800 

Leu Leu Lys Gly Ser Asn Pro Ser Ser Val Ser Lys Glu Lys He Asn 
805 810 815 

Lys Leu Asn 



<210> 9 
<211> 2451 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 9 

atgaaaatta ataagaaata ccttgttggt tctgcggcag ctttgatttt aagtgtttgt 60 
tcttacgagt tgggactgta tcaagctaga acggttaagg aaaataatcg tgtttcctat 120 
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atagatggaa aacaagcgac gcaaaaaacg 
cgtgaaggaa tcaatgctga gcaaatcgtc 
tcacatggcg accactatca ttattacaat 
gaagaattac tcatgaaaga tccaaactat 
gtcaagggtg gatatgttat caaggtagat 
gcccacgcgg ataacgtccg tacaaaagag 
caacatcgtg aaggtggaac tccaagaaac 
ggacgctata ctacagatga tggttatatc 
ggtgatgctt atatcgttcc tcatggagat 
tcagctagcg agttggctgc tgcagaagcc 
tcaagaacct atcgccgaca aaatagcgat 
gtaagcaatc caggaactac aaatactaac 
gcaagtcaaa gtaatgacat tgatagtctc 
caacgacatg tagaatctga tggccttgtc 
gctagaggtg ttgcagtgcc acacggagat 
tctgaattgg aagaacgaat cgctcgtatt 
gtaccagatt caaggccaga acaaccaagt 
ccgcaacctg caccaaatct taaaatagac 
cgaaaagttg gggaaggata tgtattcgaa 
aaagatttac catctgaaac tgttaaaaat 
gtttcacaca ctttaactgc taaaaaagaa 
gataaagcat ataatctgtt aactgaggct 
aattctgatt tccaagcctt agacaaatta 
aaagaaaaat tggtagatga tttattggca 
cttggcaaac caaattctca aattgagtat 
gctgataagt atacaacgtc agatggttac 
gaaggagatg catatgtaac gcctcatatg 
ctttctgata aggaaaaagt tgcagctcaa 
ccatctccag acgcagatgt taaagcaaat 
aatcgtgtga aaggggaaaa acgaattcca 
acagttgagg ttaaaaacgg taatttgatt 
aaatttgctt ggtttgatga tcacacatac 
ttgtttgcga cgattaagta ctacgtagaa 
ggatggggca atgccagtga gcatgtgtta 
aagaacttca aagcggatga agagccagta 
caagtagaga ctgaaaaagt agaagcccaa 
gtaacggatt ctagtctgaa agccaatgca 
ttgactcttc aaattatgga taacaatagt 
ttgttaaaag gaagtaatcc ttcatctgta 

<210> 10 
<211> 819 
<212> PRT 

<213> Streptococcus pneumoniae 
<400> 10 

Met Lys lie Asn Lys Lys Tyr Leu , 
1 5 



gagaatttga ctcctgatga ggttagcaag 180 
atcaagataa cagaccaagg ctatgtcact 240 
ggtaaggttc cttatgacgc tatcatcagt 3 00 
aagctaaaag atgaggatat tgttaatgag 360 
ggaaaatact atgtttacct taaggatgct 420 
gaaatcaatc gacaaaaaca agagcatagt 480 
gatggtgctg ttgccttggc acgttcgcaa 54 0 
tttaatgctt ctgatatcat agaggatact 600 
cattaccatt acattcctaa gaatgagtta 660 
ttcctatctg gtcgaggaaa tctgtcaaat 720 
aacacttcaa gaacaaactg ggtaccttct 780 
acaagcaaca acagcaacac taacagtcaa 84 0 
ttgaaacagc tctacaaact gcctttgagt 900 
tttgatccag cacaaatcac aagtcgaaca 960 
cattaccact tcatccctta ctctcaaatg 1020 
attccccttc gttatcgttc aaaccattgg 1080 
ccacaaccga ctccggaacc tagtccaggc 1140 
tcaaattctt ctttggttag tcagctggta 1200 
gaaaagggca tctctcgtta tgtctttgcg 1260 
cttgaaagca agttatcaaa acaagagagt 1320 
aatgttgctc ctcgtgacca agaattttat 1380 
cataaagcct tgtttgnaaa taagggtcgt 1440 
ttagaacgct tgaatgatga atcgactaat 1500 
ttcctagcac caattaccca tccagagcga 1560 
actgaagacg aagttcgtat tgctcaatta 1620 
atttttgatg aacatgatat aatcagtgat 1680 
ggccatagtc actggattgg aaaagatagc 1740 
gcctatacta aagaaaaagg tatcctacct 1800 
ccaactggag atagtgcagc agctatttac 1860 
ctcgttcgac ttccatatat ggttgagcat 1920 
attcctcata aggatcatta ccataatatt 1980 
aaagctccaa atggctatac cttggaagat 204 0 
caccctgacg aacgtccaca ttctaatgat 2100 
ggcaagaaag accacagtga agatccaaat 2160 
gaggaaacac ctgctgagcc agaagtccct 2220 
ctcaaagaag cagaagtttt gcttgcgaaa 2280 
acagaaactc tagctggttt acgaaataat 2340 
atcatggcag aagcagaaaa attacttgcg 2400 
agtaaggaaa aaataaacta a 2451 



Gly Ser Val Ala Val Leu Ala 
10 is 
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Leu Ser Val Cys Ser Tyr Glu Leu Gly Arg Tyr Gin Ala Gly Gin Asp 
20 25 30 

Lys Lys Glu Ser Asn Arg Val Ala Tyr lie Asp Gly Asp Gin Ala Gly 
35 40 45 

Gin Lys Ala Glu Asn Leu Thr Pro Asp Glu Val Ser Lys Arg Glu Gly 
50 55 60 

He Asn Ala Glu Gin He Val He Lys He Thr Asp Gin Gly Tyr Val 
65 70 75 80 

Thr Ser His Gly Asp His Tyr His Tyr Tyr Asn Gly Lys Val Pro Tyr 
85 90 95 

Asp Ala He He Ser Glu Glu Leu Leu Met Lys Asp Pro Asn Tyr Gin 
100 105 110 

Leu Lys Asp Ser Asp He Val Asn Glu He Lys Gly Gly Tyr Val He 
115 120 125 

Lys Val Asn Gly Lys Tyr Tyr Val Tyr Leu Lys Asp Ala Ala His Ala 
130 135 140 

Asp Asn He Arg Thr Lys Glu Glu He Lys Arg Gin Lys Gin Glu Arg 
145 150 155 160 

Ser His Asn His Asn Ser Arg Ala Asp Asn Ala Val Ala Ala Ala Arg 
165 170 175 

Ala Gin Gly Arg Tyr Thr Thr Asp Asp Gly Tyr He Phe Asn Ala Ser 
180 185 190 

Asp He He Glu Asp Thr Gly Asp Ala Tyr He Val Pro His Gly Asp 
195 200 205 

His Tyr His Tyr He Pro Lys Asn Glu Leu Ser Ala Ser Glu Leu Ala 
210 215 220 

Ala Ala Glu Ala Tyr Trp Asn Gly Lys Gin Gly Ser Arg Pro Ser Ser 
225 230 235 240 

Ser Ser Ser Tyr Asn Ala Asn Pro Ala Gin Pro Arg Leu Ser Glu Asn 
245 250 255 

His Asn Leu Thr Val Thr Pro Thr Tyr His Gin Asn Gin Gly Glu Asn 
260 265 270 
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He Ser Ser Leu Leu Arg Glu Leu Tyr Ala Lys Pro Leu Ser Glu Arg 
275 280 285 

His val Glu Ser Asp Gly Leu He Phe Asp Pro Ala Gin He Thr Ser 
290 295 300 

Arg Thr Ala Arg Gly Val Ala Val Pro His Gly Asn His Tyr His Phe 
305 310 315 320 

He Pro Tyr Glu Gin Met Ser Glu Leu Glu Lys Arg He Ala Arg He 
325 330 335 

He Pro Leu Arg Tyr Arg Ser Asn His Trp Val Pro Asp Ser Arg Pro 
340 345 350 

Glu Glu Pro Ser Pro Gin Pro Thr Pro Glu Pro Ser Pro Ser Pro Gin 
355 360 365 

Pro Ala Pro Ser Asn Pro He Asp Gly Lys Leu Val Lys Glu Ala Val 
370 375 380 

Arg Lys Val Gly Asp Gly Tyr Val Phe Glu Glu Asn Gly Val Ser Arg 
385 390 395 400 

Tyr He Pro Ala Lys Asp Leu Ser Ala Glu Thr Ala Ala Gly He Asp 
405 410 415 

Ser Lys Leu Ala Lys Gin Glu Ser Leu Ser His Lys Leu Gly Thr Lys 
420 425 430 

Lys Thr Asp Leu Pro Ser Ser Asp Arg Glu Phe Tyr Asn Lys Ala Tyr 
435 440 445 

Asp Leu Leu Ala Arg He His Gin Asp Leu Leu Asp Asn Lys Gly Arg 
450 455 460 

Gin Val Asp Phe Glu Ala Leu Asp Asn Leu Leu Glu Arg Leu Lys Asp 
465 470 475 480 

Val Ser Ser Asp Lys Val Lys Leu Val Glu Asp He Leu Ala Phe Leu 
485 490 495 

Ala Pro He Arg His Pro Glu Arg Leu Gly Lys Pro Asn Ala Gin He 
500 505 sio 

Thr Tyr Thr Asp Asp Glu He Gin Val Ala Lys Leu Ala Gly Lys Tyr 
515 520 525 
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Thr Ala Glu Asp Gly Tyr He Phe Asp Pro Arg Asp He Thr Ser Asp 
530 535 540 

Glu Gly Asp Ala Tyr Val Thr Pro His Met Thr His Ser His Trp He 
545 550 555 560 

Lys Lys Asp Ser Leu Ser Glu Ala Glu Arg Ala Ala Ala Gin Ala Tyr 
565 570 575 

Ala Glu Glu Lys Gly Leu Thr Pro Pro Ser Thr Asp His Gin Asp Ser 
580 585 590 

Gly Asn Thr Glu Ala Lys Gly Ala Glu Ala He Tyr Asn Arg Val Lys 
595 600 605 

Ala Ala Lys Lys Val Pro Leu Asp Arg Met Pro Tyr Asn Leu Gin Tyr 
610 615 620 

Thr Val Glu Val Lys Asn Gly Ser Leu He He Pro His Tyr Asp His 
625 630 635 640 

Tyr His Asn He Lys Phe Glu Trp Phe Asp Glu Gly Leu Tyr Glu Ala 
645 650 655 

Pro Lys Gly Tyr Thr Leu Glu Asp Leu Leu Ala Thr Val Lys Tyr Tyr 
660 665 670 

Val Glu His Pro Asn Glu Arg Pro His Ser Asp Asn Gly Phe Gly Asn 
675 680 685 

Ala Ser Asp His Val Gin Arg Asn Lys Asn Gly Gin Ala Asp Thr Asn 
690 695 700 



Gin Thr Glu Lys Pro Ser Glu Glu Lys Pro Gin Thr Glu Lys Pro Glu 
705 710 



715 



720 



Glu Glu Thr Pro Arg Glu Glu Lys Pro Gin Ser Glu Lys Pro Glu Ser 
725 730 735 

Pro Lys Pro Thr Glu Glu Pro Glu Glu Ser Pro Glu Glu Ser Glu Glu 
74 0 745 750 

Pro Gin Val Glu Thr Glu Lys Val Glu Glu Lys Leu Arg Glu Ala Glu 
7 55 760 765 



Asp Leu Leu Gly Lys He Gin Asp Pro He He Lys Ser Asn Ala Lys 
770 775 780 
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Glu Thr Leu Thr Gly Leu Lys Asn Asn Leu Leu Phe Gly Thr Gin Asp 
785 790 795 



800 



Asn Asn Thr He Met Ala Glu Ala Glu Lys Leu Leu Ala Leu Leu Lys 
805 810 815 



Glu Ser Lys 



<210> 11 
<211> 2531 
<212> DNA 

<213> Streptococcus pneumoniae 
<400> 11 

atgaaaatta ataaaaaata tctagcaggt 
tcctatgagc ttggacgtta ccaagctggt 
tatatagatg gtgatcaggc tggtcaaaag 
aagagggagg ggatcaacgc cgaacaaatt 
acctctcatg gagaccatta tcattactat 
agtgaagagc tcctcatgaa agatccgaat 
gaaatcaagg gtggttatgt cattaaggta 
gcrgctcatg cggataatat tcggacaaaa 
agtcataatc ataactcaag agcagataat 
tatacaacgg atgatgggta tatcttcaat 
gcttatatcg ttcctcacgg cgaccattac 
agcgagttag ctgctgcaga agcctattgg 
agttctagtt ataatgcaaa tccagctcaa 
gtcactccaa cttatcatca aaatcaaggg 
tatgctaaac ccttatcaga acgccatgtg 
caaatcacaa gtcgaaccgc cagaggtgta 
atcccttatg aacaaatgtc tgaattggaa 
tatcgttcaa accattgggt accagattca 
ccagaaccta gtccaagtcc gcaaccagct 
aaagaagctg ttcgaaaagt aggcgatggt 
tatatcccag ccaaggatct ttcagcagaa 
aagcaggaaa gtttatctca taagctagga 
cgagaatttt acaataaggc ttatgactta 
aataaaggtc gacaagttga ttttgaggct 
gtctcaagtg ataaagtcaa gttagtggaa 
catccagaac gtttaggaaa accaaatgcg 
gtagccaagt tggcaggcaa gtacacagca 
ataaccagtg atgaggggga tgcctatgta 
aaaaaagata gtttgtctga agctgagaga 
ggtttgaccc ctccttcgac agaccatcag 
gaagctatct acaaccgmgt gaaagcagct 



tcagtggcag tccttgccct aagtgtttgt 60 
caggataaga aagagtctaa tcgagttgct 120 
gcagaaaact tgacaccaga tgaagtcagt 180 
gttatcaaga ttacggatca aggttatgtg 24 0 
aatggcaagg ttccttatga tgccatcatc 3 00 
tatcagttga aggattcaga cattgtcaat 360 
aacggtaaat actatgttta ccttaaggat 420 
gaagagatta aacgtcagaa gcaggaacgc 480 
gctgttgctg cagccagagc ccaaggacgt 540 
gcatctgata tcattgagga cacgggtgat 600 
cattacattc ctaagaatga gttatcagct 660 
aatgggaagc agggatctcg tccttcttca 720 
ccaagattgt cagagaacca caatctgact 780 
gaaaacattt caagcctttt acgtgaattg 840 
gaatctgatg gccttatttt cgacccagcg 900 
gctgtccctc atggtaacca ttaccacttt 960 
aaacgaattg ctcgtattat tccccttcgt 1020 
agaccagaag aaccaagtcc acaaccgact 1080 
ccaagcaatc caattgatgg gaaattggtc 114 0 
tatgtctttg aggagaatgg agtttctcgt 1200 
acagcagcag gcattgatag caaactggcc 1260 
actaagaaaa ctgacctccc atctagtgat 132 0 
ctagcaagaa ttcaccaaga tttacttgat 1380 
ttggataacc tgttggaacg actcaaggat 144 0 
gatattcttg ccttcttagc tccgattcgt 1500 
caaattacct acactgatga tgagattcaa 1560 
gaagacggtt atatctttga tcctcgtgat 1620 
actccacata tgacccatag ccactggatt 168 0 
gcggcagccc aggcttatgc traagagaaa 174 0 
gattcaggaa atactgaggc aaaaggagca 1800 
aagaaggtgc cacttgatcg tatgccttac 1860 
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aatcttcaat atactgtaga agtcaaaaac 
taccataaca tcaaatttga gtggtttgac 
actcttgagg atcttttggc gactgtcaag 
cattcagata atggttttgg taacgctagc 
gctgatacca atcaaacgga aaaaccaagc 
gaagaaaccc ctcgagaaga gaaaccgcaa 
gaggaaccag aagaatcacc agaggaatca 
gaagaaaaac tgagagaggc tgaagattta 
tccaatgcca aagagactct cacaggatta 
aacaatacta ttatggcaga agctgaaaaa 
aggtagaagc ttaagggcga atttggcacc 
gaaaaactat t 



ggtagtttaa tcatacctca ttatgaccat 1920 
gaaggccttt atgaggcacc taaggggtat 1980 
tactatgtcg aacatccaaa cgaacgtccg 2040 
gaccatgttc aaagaaacaa aaatggtcaa 2100 
gaggagaaac ctcagacaga aaaacctgag 2160 
agcgagaaac cagagtctcc aaaaccaaca 2220 
gaagaacctc aggtcgagac tgaaaaggtt 2280 
cttggaaaaa tccaggatcc aattatcaag 234 0 
aaaaataatt tactatttgg cacccaggac 24 00 
ctattggctt tattaaagga gagtaagtaa 2460 
caggacaaca atactattat ggcagaagct 2520 

2531 
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